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METHOD FOR DETECTION OF DRUG-SELECTCD-MLJTAT.IQNS_, 
" IN THE HIV PROTEASE GEN E ~" 

^1. FIELD OF THE INVENTION 

The present invention relates to the field of HIV diagnosis. More particularly, the present 
invention relates to the field of diagnosing the susceptibility of an HIV sample to antiviral drugs used 
to treat HIV infection. 

The present invention relates to a method for the rapid and reliable detection of drug-selected 
mutations in the HIV protease gene allowing the simultaneous characterization of a range of codons 
involved in drug resistance using specific sets of probes optimized to function together in a reverse- 
hybridization assay. 

2. BACKGROUND OF THE INVENTION 

The human immunodeficiency virus (HIV) is the ethiological agent for the acquired 
immunodeficiency syndrome (AIDS). HIV, like other retroviruses, encodes an aspartic protease that 
mediates the maturation of the newly produced viral particle by cleaving viral polypeptides into their 
functional forms (Hunter et al). The HIV protease is a dimeric molecule consisting of two identical 
subunits each contributing a catalytic aspartic residue (Navia et al, Whodawer et al, Meek et al). 
Inhibition of this enzyme gives rise to noninfectious viral particles that cannot establish new cycles of 
viral replication (Kohl et al, Peng et al). 

Attempts to develop inhibitors of HIV- 1 protease were initially based on designing peptide 
compounds that mimicked the natural substrate. The availability of the 3-dimensional structure of the 
enzyme have more recently allowed the rational design of protease inhibitors (PI) using computer 
modeling (Huff et al, Whodawer et al). A number of second generation PI that are partially peptidic or 
entirely nonpeptidic have proven to exhibit particularly potent antiviral effects in cell culture. 
Combinations of various protease inhibitors with nucleoside and non-nucleoside RT inhibitors have also 
been studied extensively in vitro. In every instance, the combinations have been at least additive and 
usually synergistic. 

In spite of the antiviral potency of many recently developed HIV-1 PI, the emergence of virus 
variants with decreased sensitivity to these compounds has been described both in cell culture and in 
treated patients thereby escaping the inhibitory effect of the antiviral (Condra et al.). Emergence of 
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resistant variants depends on the selective pressure applied to the viral population. In the case of a 
relatively ineffective drug, selective pressure is low because replication of both wild-type virus and any 
variants can continue. If a more effective drug suppresses replication of virus except for a resistant 
variant, then that variant will be selected. Virus variants that arise from selection by PI carry several 
distinct mutations in the protease coding sequence that appear to emerge sequentially. A number of these 
cluster near the active site of the enzyme while others are found at distant sites. This suggests 
conformational adaptation to primary changes in the active site and in this respect certain mutations that 
increase resistance to PI also decrease protease activity and virus replication. 

Amongst the PI, the antiviral activity of the PI ritonavir (A-75925; ABT-538), nelfinavir (AG- 
1343), indinavir (MK-639; L735; L524) and saquinavir (Ro 31-8959) have been approved by the Food 
and Drug Administration and are currently under evaluation in clinical trials involving HIV-infected 
patients. The VX-487 (141 W94) antiviral compound is not yet approved. The most important mutations 
selected for the above compounds and leading to gradually increasing resistance are found at amino acid 
(aa) positions 30 (D to N), 46 (M to I), 48 (G to V), 50 (I to V), 54 (I to A, I to V), 82 (V to A, or F, or 
I, or T), 84 (I to V) and 90 (L to M). Other mutations associated with drug resistance to the mentioned 
compounds have been described (Schinazi et al). Saquinavir-resistant variants, which usually carry 
mutations at amino acid positions 90 and/or 48, emerge in approximately 45% of patients after 1 year 
of monotherapy. Resistance appears to develop less frequently with higher doses of saquinavir. 
Resistance to indinavir and ritonavir requires multiple mutations; usually at greater than 3 and up to 1 1 
sites, with more amino acid substitutions conferring higher levels of resistance. Resistant isolates usually 
carry mutations at codons 82, 84, or 90. In the case of ritonavir, the mutation at codon 82 appears first 
in most patients. Although mutant virions resistant to saquinavir are not cross-resistant to indinavir or 
ritonavir, isolates resistant to indinavir are generally ritonavir resistant and visa versa. Resistance to 
either indinavir or ritonavir usually results in cross-resistance to saquinavir. Approximately one third 
of indinavir resistant isolates are cross-resistant to nelfinavir as well. 

The regime for an efficient antiviral treatment is currently not clear at all. Patterns of reduced 
susceptibility to HIV protease inhibitors have been investigated in vitro by cultivating virus in the 
presence of PI. These data, however, do not completely predict the pattern of amino-acid changes 
actually seen in patients receiving PI. Knowledge of the resistance and cross-resistance patterns should 
facilitate selection of optimal drug combinations and selection of sequences with non-overlapping 
resistance patterns. This would delay the emergence of cross-resistant viral strains and prolong the 
duration of effective antiretroviral activity in patients. Therefore, there is need for methods and systems 
that detect these mutational events in order to give a better insight into the mechanisms of HIV 
resistance. Further, there is need for methods and systems which can provide data important for the 
antiviral therapy to follow in a more time-efficient and economical manner compared to the conventional 
cell-culture selection techniques. 
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3. AIMS OF THE INVENTION 

It is an aim of the present invention to develop a rapid and reliable detection method for 
5 determination of the antiviral drug resistance of viruses, which contain protease genes such as HIV 
retroviruses present in a biological sample. 

More particularly it is an aim of the present invention to provide a genotyping assay allowing 
the detection of the different HIV protease gene wild type and mutation codons involved in the antiviral 
resistance in one single experiment. 
10 It is also an aim of the present invention to provide an HTV protease genotyping assay or method 

which allows to infer the nucleotide sequence at codons of interest and/or the amino acids at the codons 
of interest and/or the antiviral drug selected spectrum, and possibly also infer the HIV type or subtype 
y;3 isolate involved. 

n j Even more particularly it is an aim of the present invention to provide a genotyping assay 

C;3 15 allowing the detection of the different HIV protease gene polymorphisms representing wild-type and 

j mutation codons in one single experimental setup, 

ifi It is another aim of the present invention to select particular probes able to discriminate wild- 

ly type HIV protease sequences from mutated or polymorphic HTV protease sequences conferring resistance 

CI1 to one or more antiviral drugs, such as ritonavir (A-75925; ABT-538), nelfmavir (AG-1343), indinavir 

20 (MK-639; L735; L524), saquinavir (Ro 31-8959) and VX-478 (141W94) or others (Shinazi et at). 
£3 It is more particularly an aim of the present invention to select particular probes able to 

r f discriminate wild-type HIV protease sequences from mutated or polymorphic HIV protease sequences 

conferring resistance to ritonavir (A-75925; ABT-538). 

It is more particularly an aim of the present invention to select particular probes able to 
25 discriminate wild-type HIV protease sequences from mutated HIV protease sequences conferring 
resistance to nelfinavir (AG-1343). 

It is more particularly an aim of the present invention to select particular probes able to 
discriminate wild-type HIV protease sequences from mutated HTV protease sequences conferring 
resistance to indinavir (MK-639; L735; L524). 
30 It is more particularly an aim of the present invention to select particular probes able to 

discriminate wild-type HIV protease sequences from mutated HIV protease sequences conferring 
resistance to saquinavir (Ro 3 1-8959). 

It is more particularly an aim of the present invention to select particular probes able to 
discriminate wild-type HIV protease sequences from mutated HIV protease sequences conferring 
35 resistance to VX-478 (141 W94). 

It is also an aim of the present invention to select particular probes able to determine and/or infer 
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cross-resistance to HIV protease inhibitors. 

It is more particularly an aim of the present invention to select particular probes able to 
discriminate wild-type HIV protease from mutated HIV protease sequences involving at least one of 
amino acid positions 30 (D to N), 46 (M to I), 48 (G to V), 50 (I to V), 54 (I to A or V), 82 (V to A or 
F or I or T), 84(1 to V) and 90 (L to M) of the viral protease gene. 

It is particularly an aim of the present invention to select a particular set of probes, able to 
discriminate wild-type HIV protease sequences from mutated HIV protease sequences conferring 
resistance to any of the antiviral drugs defined above with this particular set of probes being used in a 
reverse hybridization assay. 

It is moreover an aim of the present invention to combine a set of selected probes able to 
discriminate wild-type HIV protease sequences from mutated HIV protease sequences conferring 
resistance to antiviral drugs with another set of selected probes able to identify the HIV isolate, type or 
yg subtype present in the biological sample, whereby all probes can be used under the same hybridization 

and wash-conditions. 

MJ 

C3 15 It is also an aim of the present invention to select primers enabling the amplification of the gene 

i7| fragment(s) determining the antiviral drug resistance trait of interest. 

V\ It is also an aim of the present invention to select particular probes able to identify mutated HIV 

protease sequences resulting in cross-resistance to antiviral drugs. 

The present invention also aims at diagnostic kits comprising said probes useful for developing 
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SI 

^ 20 such a genotyping assay. 



The present invention also aims at diagnostic kits comprising said primers useful for developing 
such a genotyping assay. 



25 4. DETAILED DESCRIPTION OF THE INVENTION. 

All the aims of the present invention have been met by the following specific embodiments. 

According to one embodiment, the present invention relates to a method for determining the 
susceptibility to antiviral drugs of HIV viruses in a biological sample, with said method comprising: 
30 a) if need be, releasing, isolating or concentrating the polynucleic acids present in the sample; 

b) if need be amplifying the relevant part of the protease gene of HIV with at least one suitable primer 
pair; 

c) hybridizing the polynucleic acids of step a) or b) with at least one of the following probes: 

probes specifically hybridizing to a target sequence comprising codon 30; 
35 probes specifically hybridizing to a target sequence comprising codon 46 and/or 48; 

probes specifically hybridizing to a target sequence comprising codon 50; 
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probes specifically hybridizing to a target sequence comprising codon 54; 
probes specifically hybridizing to a target sequence comprising codon 82 and/or 84; 
probes specifically hybridizing to a target sequence comprising codon 90; 
or the complement of said probes , 

further characterized in that said probes specifically hybridize to any of the target 
sequences presented in figure 1, or to the complement of said target sequences : 
d) inferring from the result of step c) whether or not a mutation giving rise to drug resistance is 
present in any of said target sequences. 
The numbering of HIV- 1 protease gene encoded amino acids is as generally accepted in literature. 
Mutations that give rise to an amino acid change at position 48 or 90 are known to confer resistance to 
saquinavir (Erlebe et al; Tisdale et al). An amino acid change at codon 46 or 54 or 82 or 84 results in 
ritonavir and indinavir resistance (Kempf et al; Emini et al; Condra et al). Amino acid changes at 
positions 30 and 46 confer resistance to nelfmavir (Patick et al) an<j amino acid changes at position 50 
confers resistance to VX-487 (Rao et al). Therefore, the method described above allows to determine 
whether a HIV strain is susceptible or resistant to any of the drugs mentioned above. This method can 
be used, for instance, to screen for mutations conferring resistance to any of the mentioned drugs before 
initiating therapy. This method may also be used to screen for mutations that may arise during the course 
of therapy (i.e. monitoring of drug therapy). It is obvious that this method may also be used to determine 
resistance to drugs other than the above-mentioned drugs, provided that resistance to these other drugs 
is linked to mutations that can be detected by use of this method. This method may also be used for the 
specific detection of polymorphic nucleotides. It is to be understood that the said probes may only partly 
overlap with the targets sequences of figure 1, table 2 and table 3, as long as they allow for specific 
detection of the relevant polymorphic nucleotides as indicated above. The sequences of figure 1, table 
2 and table 3 were derived from polynucleic acid fragments comprising the protease gene. These 
fragments were obtained by PCR amplification and were inserted into a cloning vector and sequence 
analyzed as described in example 1 . It is to be noted that some polynucleic acid fragments comprised 
polymorphic nucleotides in their sequences, which have not been previously disclosed. These novel 
polymorphic nucleotide sequences are represented in table 4 below, 

TABLE 4: Polymorphic nucleotide sequences. 



51 


52 


53 


54 


55 


56 


57 


58 


codon position 


gga 


ggt 


ttt 


ate 


aaa 


gta 


aga 


cag 


consensus sequenc 


GGA 


GGT 


TTT 


ATC 


AAA 


GTC 


AGA 


CAA 


SEQ ID NO 478 


GGA 


GGT 


TTC 


ATT 


AAG 


GTA 


AAA 


CAG 


SEQ ID NO 479 


GGA 


GGT 


TTT 


ATT 


AAG 


GTA 


AGA 


CAG 


SEQ ID NO 48 0 
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GGA 


GGT 


TTT 


ATT 


AAA 


GTA 


AGA 


CAA 




GGA 


GGC 


TTT 


ATC 


AAA 


GTA 


AGA 


CAA 




GGA 


GGT 


TTT 


ATC 


AAA 


GTC 


AGA 


CAA 




78 


79 


80 


81 


82 


83 


84 


85 




gga 


CCt 


aca 


CCt 


gtc 


aac 


ata 


att 


gg 


GGA 


CCT 


ACA 


CCG 


GTC 


AAC 


ATA 


ATT 


GG 


GGA 


CCT 


ACA 


CCT 


GCC 


AAT 


ATA 


ATT 


GG 


GGA 


CCT 


ACG 


CCC 


TTC 


AAC 


ATA 


ATT 


GG 


GGA 


CCG 


ACA 


CCT 


GTC 


ACC 


ATA 


ATT 


GG 


GGA 


CCT 


ATA 


CCT 


GTC 


AAC 


ATA 


ATT 


GG 



87 88 89 90 91 92 93 94 
a aga aat ctg ttg act cag att ggc 

A AAA AAT CTG ATG ACT CAG ATT GGC 
A AGA ACT CTG TTG ACT CAG CTT GGA 
A AGA AAT ATG ATG ACC CAG CTT GGC 
A AGA AAT ATA ATG ACT CAG CTT GGA 
A AGA AAT CTG CTG ACT CAG ATT GGG 
A AGA AAT CTG TTG ACA CAG CTT GGC 
A AGA AAT ATG TTG ACT CAG CTT GGT 
A AGA AAT TTG TTG ACT CAG ATT GGG 
A AGA AAT ATG TTG ACT CAG CTT GGT 
A AGA AAT ATG TTG ACT CAG CTT GGA 
A AGA AAT CTG TTG ACT CAG CTT GGA 
A AGA AAC CTG TTG ACT CAA CTT GGT 



SEQ ID NO 481 
SEQ ID NO 4 82 
SEQ ID NO 483 

codon position 
consensus sequence 

SEQ ID NO 4 84 
SEQ ID NO 485 
SEQ ID NO 486 
SEQ ID NO 4 87 
SEQ ID NO 488 

codon position 
consensus sequence 

SEQ ID NO 489 
SEQ ID NO 4 90 
SEQ ID NO 491 
SEQ ID NO 4 92 
SEQ ID NO 4 93 
SEQ ID NO 4 94 
SEQ ID NO 4 95 
SEQ ID NO 496 
SEQ ID NO 4 97 
SEQ ID NO 498 
SEQ ID NO 499 
SEQ ID NO 500 



The present invention thus also relates to these novel sequences, or a fragment thereof, wherein said 
fragment consists of at least 10, preferably 15, even more preferably 20 contiguous nucleotides and 
contains at least one polymorphic nucleotide. It is furthermore to be understood that these new 
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polymorphic nucleotides may also be expected to arise in another sequence context than in the mentioned 
sequences. For instance a G at the third position of codon 55 is shown in SEQ ID N° 478 in 
combination with a T at the third position of codon 54, but a G at the third position of codon 55 may also 
be expected to occur in the context of a wild type sequence. It is also to be understood that the above 
mentioned specifications apply to the complement of the said target sequences as well. This applies also 
to Figure 1 . 

According to a preferred embodiment the present invention relates to a method as indicated 
above, further characterized in that said probes are capable of simultaneously hybridizing to their 
respective target regions under appropriate hybridization and wash conditions allowing the detection of 
the hybrids formed. 

According to a preferred embodiment, step c is performed using a set of at least 2, preferably at 
least 3, more preferably at least 4 and most preferably at least 5 probes meticulously designed as such 
that they show the desired hybridization results. In general this method may be used for any purpose that 
relies on the presence or absence of mutations that can be detected by this method, e.g. for genotyping. 
The probes of table 1 have been optimized to give specific hybridization results when used in a LiPA 
assay (see below), as described in examples 2 and 3. These probes have thus also been optimized to 
simultaneously hybridize to their respective target regions under the same hybridization and wash 
conditions allowing the detection of hybrids. The sets of probes for each of the codons 30, 46/48, 50, 
54 and 82/84 have been tested experimentally as described in examples 2 and 3. The reactivity of the 
sets shown in table 1 with 856 serum samples from various geographic origins was evaluated. It was 
found that the sets of probes for codons 30, 46/48, 50, 54 and 82/84 reacted with 98.9%, 99.6%, 98.5%, 
99.2%, 95.4% and 97.2% of the test samples, respectively. The present invention thus also relates to the 
sets of probes for codons 30, 46/48, 50, 54, 82/84 and 90, shown in table 1 and table 7. 

According to another even more preferred embodiment, the present invention relates to a method 
as defined above, further characterized in that: 
step b) comprises amplifying a fragment of the protease gene with at least one 5 '-primer 
specifically hybridizing to a target sequence located between nucleotide position 210 and 
nucleotide position 260 (codon 87), more preferably between nucleotide position 220 and 
nucleotide position 260 (codon 87), more preferably between nucleotide position 230 and 
nucleotide position 260 (codon 87), even more preferably at nucleotide position 241 to 
nucleotide position 260 (codon 87) in combination with at least one suitable 3- 
primer, and 

step c) comprises hybridizing the polynucleic acids of step a) or b) with at least one of the probes 
specifically hybridizing to a target sequence comprising codon 90. 

According to another even more preferred embodiment, the present invention relates to a method 
as defined above, further characterized in that: 
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step b) comprises amplifying a fragment of the protease gene with at least one 3-primer 
specifically hybridizing to a target sequence located between nucleotide position 253 (codon 85) 
and nucleotide positions 300, more preferably between nucleotide position 253 (codon 85) 
and nucleotide positions 290, more preferably between nucleotide position 253 (codon 85) 
5 and nucleotide positions 280, even more preferably at nucleotide position 253 (codon 85) 

to nucleotide position 273 (codon 91), in combination with at least one suitable 5 -primer, and 
step c) comprises hybridizing the polynucleic acids of step a) or b) with at least one of the probes 
specifically hybridizing to a target sequence comprising any of codons 30, 46, 48, 50, 52, 54, 82 and 
84. 

10 It has been found, unexpectedly, that an amplified nucleic acid fragment comprising all of the above- 
mentioned codons, does not hybridize optimally to probes comprising codon 82, 84 or 90. On the other 
p «! hand, a shorter fragment, for instance the fragment which is amplified by use of the primers Prot41bio 

j0 and Prot6bio with respectively seq id no 5 and seq id no 4, hybridizes better to probes comprising codon 

"""4 

py 90. Better hybridization is also obtained when the fragment is amplified with primer Prot41bio in 

^j; 1 5 combination with primers Prot6abio, Prot6bbio, Prot6cbio and Prot6dbio The present invention thus also 
iy relates to a method as defined above, further characterized in that the S'-primer is seq id no 5 and at least 

y ' one 3 " primer is chosen from seq id no 4, seq id no506, seq id no 507, seq id no 508, and seq id no 509. 

Q Likewise, another shorter fragment, for instance the fragment which is amplified by use of the primers 

*f j Prot2bio and Prot3 lbio with respectively seq id no 3 and seq id no 6 , was found to hybridize better to 

y£j 20 probes comprising codon 82 and/or 84. Hence the present invention also relates to a method as defined 
above, further characterized in that the 5 -primer is seq id no 5 and at least one 3 -primer is chosen from 
seq id no 4, seq id no506, seq id no 507, seq id no 508, and seq id no 509.. 

New sets of amplification primers as mentioned in example 1 were selected. The present invention thus 
also relates to primers: prot 16 (SEQ ID NO 501), prot 5 (SEQ ED NO 5), prot2a bio (SEQ ID NO 503), 

25 prot2b bio (SEQ ID NO 504), prot31 bio (SEQ ID NO 6), prot41-bio (SEQ ID NO 505), prot6a (SEQ 
ID NO 506), prot6b (SEQ ID NO 507), prot6c (SEQ ID NO 508) and prot6d (SEQ ID NO 509). A 
number of these primers are chemically modified (biotinylated), others are not. The present invention 
relates to any of the primers mentioned, primers containing unmodified nucleotides, or primers 
containing modified nucleotides. 

30 Different techniques can be applied to perform the sequence-specific hybridization methods of the 
present invention. These techniques may comprise immobilizing the amplified HIV polynucleic acids 
on a solid support and performing hybridization with labeled oligonucleotide probes. HIV polynucleic 
acids may also be immobilized on a solid support without prior amplification and subjected to 
hybridization. Alternatively, the probes may be immobilized on a solid support and hybridization may 

35 be performed with labeled HIV polynucleic acids, preferably after amplification. This technique is called 
reverse hybridization. A convenient reverse hybridization technique is the line probe assay (LiPA). This 
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assay uses oligonucleotide probes immobilized as parallel lines on a solid support strip (Stuyver et al., 
1993). It is to be understood that any other technique based on the above-mentioned methods is also 
covered by the present invention. 

According to another preferred embodiment, the present invention relates to any of the probes 
mentioned above and/or to any of the primers mentioned above, with said primers and probes being 
designed for use in a method for determining the susceptibility to antiviral drugs of HIV viruses in a 
sample. According to an even more preferred embodiment, the present invention relates to the probes 
with seq id no 7 to seq id no 477 and seq id no510 to seq id no 519, more preferably to the seq id no 
mentioned in Table 1 and Table 7, and to the primers with seq id no 3, 4, 5 and 6, 501, 502, 503, 504, 
505, 506, 507, 508 and 509. The skilled man will recognize that addition or deletion of one or more 
nucleotides at their extremities may adapt the said probes and primers. Such adaptations may be required 
if the conditions of amplification or hybridization are changed, or if the amplified material is RNA 
instead of DNA, as is the case in the NASBA system. 

According to another preferred embodiment, the present invention relates to a diagnostic kit 
enabling a method for determining the susceptibility to antiviral drugs of HTV viruses in a biological 
sample, with said kit comprising: 

a) when appropriate, a means for releasing, isolating or concentrating the polynucleic acids present 
in said sample; 

b) when appropriate, at least one of the primers of any of claims 4 to 6; 

c) at least one of the probes of any of claims 1 to 3 , possibly fixed to a solid support; 

d) a hybridization buffer, or components necessary for producing said buffer; 

e) a wash solution, or components necessary for producing said solution; 

f) when appropriate, a means for detecting the hybrids resulting from the preceding hybridization; 
h) when appropriate, a means for attaching said probe to a solid support. 

DEFINITIONS 

The following definitions serve to illustrate the terms and expressions used in the present invention. 

The term "antiviral drugs" refers particularly to any antiviral protease inhibitor. Examples of 
such antiviral drugs and the mutation they may cause in the HIV protease gene are disclosed in Schinazi 
et al., 1997. The contents of the latter two documents particularly are to be considered as forming part 
of the present invention. The most important antiviral drugs focussed at in the present invention are 
disclosed in Tables 1 to 2. 

The target material in the samples to be analyzed may either be DNA or RNA, e.g.: genomic 
DNA, messenger RNA, viral RNA or amplified versions thereof. These molecules are also termed 
polynucleic acids. 

It is possible to use genomic DNA or RNA molecules from HIV samples in the methods 
according to the present invention. 
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Well-known extraction and purification procedures are available for the isolation of RNA or 
DNA from a sample (fi. in Maniatis et al., Molecular Cloning: A Laboratory Manual, 2nd Edition, Cold 
Spring Harbour Laboratory Press (1989)). 

The term "probe" refers to single stranded sequence-specific oligonucleotides, which have a 
sequence, which is complementary to the target sequence to be detected. 

The term "target sequence" as referred to in the present invention describes the wild type 
nucleotide sequence, or the sequence comprising one or more polymorphic nucleotides of the protease 
gene to be specifically detected by a probe according to the present invention. This nucleotide sequence 
may encompass one or several nucleotide changes. Target sequences may refer to single nucleotide 
positions, codon positions, nucleotides encoding amino acids or to sequences spanning any of the 
foregoing nucleotide positions. In the present invention said target sequence often includes one or two 
variable nucleotide positions. 

The term "polymorphic nucleotide" indicates a nucleotide in the protease gene of a particular 
HIV virus that is different from the nucleotide at the corresponding position in at least one other HIV 
virus. The polymorphic nucleotide may or may not give rise to resistance to an antiviral drug. 
It is to be understood that the complement of said target sequence is also a suitable target sequence in 
some cases. The target sequences as defined in the present invention provide sequences which should 
be complementary to the central part of the probe which is designed to hybridize specifically to said 
target region. 

The term "complementary" as used herein means that the sequence of the single stranded probe 
is exactly the (inverse) complement of the sequence of the single-stranded target, with the target being 
defined as the sequence where the mutation to be detected is located. 

"Specific hybridization" of a probe to a target sequence of the HIV polynucleic acids means that 
said probe forms a duplex with part of this region or with the entire region under the experimental 
conditions used, and that under those conditions said probe does not form a duplex with other regions 
of the polynucleic acids present in the sample to be analyzed. 

Since the current application requires the detection of single basepair mismatches, very stringent 
conditions for hybridization are required, allowing in principle only hybridization of exactly 
complementary sequences. However, variations are possible in the length of the probes (see below), and 
it should be noted that, since the central part of the probe is essential for its hybridization characteristics, 
possible deviations of the probe sequence versus the target sequence may be allowable towards head and 
tail of the probe, when longer probe sequences are used. These variations, which may be conceived from 
the common knowledge in the art, should however always be evaluated experimentally, in order to check 
if they result in equivalent hybridization characteristics than the exactly complementary probes. 

Preferably, the probes of the invention are about 5 to 50 nucleotides long, more preferably from 
about 10 to 25 nucleotides. Particularly preferred lengths of probes include 10, 1 1, 12, 13, 14, 15, 16, 17, 
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18, 19, 20, 21, 22, 23, 24 or 25 nucleotides. The nucleotides as used in the present invention may be 
ribonucleotides, deoxyribonucleotides and modified nucleotides such as inosine or nucleotides containing 
modified groups, which do not essentially alter their hybridization characteristics. 

Probe sequences are represented throughout the specification as single stranded DNA 
oligonucleotides from the 5' to the 3' end. It is obvious to the man skilled in the art that any of the below- 
specified probes can be used as such, or in their complementary form, or in their RNA form (wherein 
U replaces T). 

The probes according to the invention can be prepared by cloning of recombinant plasmids 
containing inserts including the corresponding nucleotide sequences, if need be by cleaving the latter out 
from the cloned plasmids upon using the adequate nucleases and recovering them, e.g. by fractionation 
according to molecular weight. The probes according to the present invention can also be synthesized 
chemically, for instance by the conventional phospho-triester method. 

The term "solid support" can refer to any substrate to which an oligonucleotide probe can be 
coupled, provided that it retains its hybridization characteristics and provided that the background level 
of hybridization remains low. Usually the solid substrate will be a microtiter plate, a membrane (e.g. 
nylon or nitrocellulose) or a microsphere (bead) or a chip. Prior to application to the membrane or 
fixation it may be convenient to modify the nucleic acid probe in order to facilitate fixation or improve 
the hybridization efficiency. Such modifications may encompass homopolymer tailing, coupling with 
different reactive groups such as aliphatic groups, NH 2 groups, SH groups, carboxylic groups, or 
coupling with biotin, haptens or proteins. 

The term "labeled" refers to the use of labeled nucleic acids. Labeling may be carried out by the 
use of labeled nucleotides incorporated during the polymerase step of the amplification such as illustrated 
by Saiki et al. (1988) or Bej et al. (1990) or labeled primers, or by any other method known to the person 
skilled in the art. The nature of the label may be isotopic ( 32 P, 35 S, etc.) or non-isotopic (biotin, 
digoxigenin, etc.). 

The term "primer" refers to a single stranded oligonucleotide sequence capable of acting as a 
point of initiation for synthesis of a primer extension product, which is complementary to the nucleic 
acid strand to be copied. The length and the sequence of the primer must be such that they allow to prime 
the synthesis of the extension products. Preferably the primer is about 5-50 nucleotides long. Specific 
length and sequence will depend on the complexity of the required DNA or RNA targets, as well as on 
the conditions of primer use such as temperature and ionic strength. 

The term "primer pair" refers to a set of primers comprising at least one 5' primer and one 3' 
primer. The primer pair may consist of more than two primers, the complexity of the number of primers 
will depend on the hybridization conditions, variability of the sequences in the regions to be amplified 
and the target sequences to be detected. 

The fact that amplification primers do not have to match exactly with the corresponding template 
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sequence to warrant proper amplification is amply documented in the literature (Kwok et al., 1990). 

The amplification method used can be either polymerase chain reaction (PCR; Saiki et al., 1988), 
ligase chain reaction (LCR; Landgren et al., 1988; Wu & Wallace, 1989; Barany, 1991), nucleic acid 
sequence-based amplification (NASBA; Guatelli et al., 1990; Compton, 1991), transcription-based 
amplification system (TAS; Kwoh et al., 1989), strand displacement amplification (SDA; Duck, 1990) 
or amplification by means of QJ3 replicase (Lomeli et al., 1989) or any other suitable method to amplify 
nucleic acid molecules known in the art. 

The oligonucleotides used as primers or probes may also comprise nucleotide analogues such 
as phosphorothiates (Matsukura et al., 1987), alkylphosphorothiates (Miller et al., 1979) or peptide 
nucleic acids (Nielsen et al., 1991; Nielsen et al., 1993) or may contain intercalating agents (Asseline et 
al., 1984). 

As most other variations or modifications introduced into the original DNA sequences of the 
invention these variations will necessitate adaptations with respect to the conditions under which the 
oligonucleotide should be used to obtain the required specificity and sensitivity. However the eventual 
results of hybridization will be essentially the same as those obtained with the unmodified 
oligonucleotides. 

The introduction of these modifications may be advantageous in order to positively influence 
characteristics such as hybridization kinetics, reversibility of the hybrid-formation, biological stability 
of the oligonucleotide molecules, etc. 

The "sample" may be any biological material taken either directly from the infected human being 
(or animal), or after culturing (enrichment). Biological material may be e.g. expectorations of any kind, 
broncheolavages, blood, skin tissue, biopsies, sperm, lymphocyte blood culture material, colonies, liquid 
cultures, fecal samples, urine etc. 

The sets of probes of the present invention will include at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12 
13, 14, 15, 16, 17, 18, 19, 20,21,22,23,24, 25,26, 27,28,29, 30 or more probes. Said probes may be 
applied in two or more distinct and known positions on a solid substrate. Often it is preferable to apply 
two or more probes together in one and the same position of said solid support. 

For designing probes with desired characteristics, the following useful guidelines known to the 
person skilled in the art can be applied. 

Because the extent and specificity of hybridization reactions such as those described herein are 
affected by a number of factors, manipulation of one or more of those factors will determine the exact 
sensitivity and specificity of a particular probe, whether perfectly complementary to its target or not. The 
importance and effect of various assay conditions, explained further herein, are known to those skilled 
in the art. 

The stability of the [probe : target] nucleic acid hybrid should be chosen to be compatible with 
the assay conditions. This may be accomplished by avoiding long AT-rich sequences, by terminating the 
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hybrids with G:C base pairs, and by designing the probe with an appropriate Tm. The beginning and end 
points of the probe should be chosen so that the length and %GC result in a Tm about 2-10°C higher than 
the temperature at which the final assay will be performed. The base composition of the probe is 
significant because G-C base pairs exhibit greater thermal stability as compared to A-T base pairs due 
to additional hydrogen bonding. Thus, hybridization involving complementary nucleic acids of higher 
G-C content will be stable at higher temperatures. 

Conditions such as ionic strength and incubation temperature under which a probe will be used 
should also be taken into account when designing a probe. It is known that hybridization will increase 
as the ionic strength of the reaction mixture increases, and that the thermal stability of the hybrids will 
increase with increasing ionic strength. On the other hand, chemical reagents, such as formamide, urea, 
DMSO and alcohols, which disrupt hydrogen bonds, will increase the stringency of hybridization. 
Destabilization of the hydrogen bonds by such reagents can greatly reduce the Tm. In general, optimal 
hybridization for synthetic oligonucleotide probes of about 10-50 bases in length occurs approximately 
5°C below the melting temperature for a given duplex. Incubation at temperatures below the optimum 
may allow mismatched base sequences to hybridize and can therefore result in reduced specificity. 

It is desirable to have probes, which hybridize only under conditions of high stringency. Under 
high stringency conditions only highly complementary nucleic acid hybrids will form; hybrids without 
a sufficient degree of complementarity will not form. Accordingly, the stringency of the assay conditions 
determines the amount of complementarity needed between two nucleic acid strands forming a hybrid. 
The degree of stringency is chosen such as to maximize the difference in stability between the hybrid 
formed with the target and the nontarget nucleic acid. In the present case, single base pair changes need 
to be detected, which requires conditions of very high stringency. 

The length of the target nucleic acid sequence and, accordingly, the length of the probe sequence 
can also be important. In some cases, there may be several sequences from a particular region, varying 
in location and length, which will yield probes with the desired hybridization characteristics. In other 
cases, one sequence may be significantly better than another that differs merely by a single base. While 
it is possible for nucleic acids that are not perfectly complementary to hybridize, the longest stretch of 
perfectly complementary base sequence will normally primarily determine hybrid stability. While 
oligonucleotide probes of different lengths and base composition may be used, preferred oligonucleotide 
probes of this invention are between about 5 to 50 (more particularly 10-25) bases in length and have a 
sufficient stretch in the sequence which is perfectly complementary to the target nucleic acid sequence. 

Regions in the target DNA or RNA, which are known to form strong internal structures 
inhibitory to hybridization, are less preferred. Likewise, probes with extensive self-complementarity 
should be avoided. As explained above, hybridization is the association of two single strands of 
complementary nucleic acids to form a hydrogen bonded double strand. It is implicit that if one of the 
two strands is wholly or partially involved in a hybrid that it will be less able to participate in formation 
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of a new hybrid. There can be intramolecular and intermolecular hybrids formed within the molecules 
of one type of probe if there is sufficient self complementarity. Such structures can be avoided through 
careful probe design. By designing a probe so that a substantial portion of the sequence of interest is 
single stranded, the rate and extent of hybridization may be greatly increased. Computer programs are 
available to search for this type of interaction. However, in certain instances, it may not be possible to 
avoid this type of interaction. 

Standard hybridization and wash conditions are disclosed in the Materials & Methods section 
of the Examples. Other conditions are for instance 3X SSC (Sodium Saline Citrate), 20% deionized FA 
(Formamide) at 50°C. 

Other solutions (SSPE (Sodium saline phosphate EDTA), TMAC1 (Tetramethyl ammonium 
Chloride), etc.) and temperatures can also be used provided that the specificity and sensitivity of the 
probes is maintained. If need be, slight modifications of the probes in length or in sequence have to be 
carried out to maintain the specificity and sensitivity required under the given circumstances. 

Primers may be labeled with a label of choice (e.g. biotin). Different primer-based target 
amplification systems may be used, and preferably PCR-amplification, as set out in the examples. Single- 
round or nested PCR may be used. 

The term "hybridization buffer" means a buffer enabling a hybridization reaction to occur 
between the probes and the polynucleic acids present in the sample, or the amplified products, under the 
appropriate stringency conditions. 

The term "wash solution" means a solution enabling washing of the hybrids formed under the 
appropriate stringency conditions. 

The following examples only serve to illustrate the present invention. These examples are in no way 
intended to limit the scope of the present invention. 

FIGURE AND TABLE LEGENDS 

Figure 1 : Natural and drug selected variability in the vicinity of codons 30, 46, 48, 50, 54, 82, 
84, and 90 of the HIV-1 protease gene. The most frequently observed wild-type sequence is shown in 
the top line. Naturally occurring variations are indicated below and occur independently from each other. 
Drug-selected variants are indicated in bold 

Figure 2 A: Reactivities of the selected probes for codon 30 immobilized on LiPA strips with reference 
material. The information in the boxed surface is not relevant for the discussion of probes for condon 30 
The position of each selected probe on the membrane strip is shown at the left of each panel. The 
sequence of the relevant part of the selected probes is shown at the left and is given in Table 1. Each strip 
is incubated with a bio tiny lated PCR fragment from the reference panel. The reference panel accession 
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numbers are indicated in Table 1 . For several probes multiple reference panel possibilities are available, 
but only one relevant accession number given each time. *: False positive reactivities. At the bottom the 
strips, the amino acids at the relevant codon, as derived from the probe reactivity, is indicated. 

5 Figure 2 B: Reactivities of the selected probes for codons 46 and 48 immobilized on LiPA strips with 
reference material. The information in the boxed surface is not relevant for the discussion of probes for 
condons 46 and 48 The position of each selected probe on the membrane strip is shown at the left of each 
panel. The sequence of the relevant part of the selected probes is given in Table 1. Each strip is incubated 
with a biotinylated PCR fragment from the reference panel. The reference panel accession numbers are 
1 0 indicated in Table 1 . For several probes multiple reference panel possibilities are available, but only one 
relevant accession number given each time. *: False positive reactivities. On top of the strips, the amino 
acids at the relevant codon, as derived from the probe reactivity, is indicated. 

Figure 2 C: Reactivities of the selected probes for codon 50 immobilized on LiPA strips with reference 
*3 15 material. The information in the boxed surface is not relevant for the discussion of probes for condon 50. 



01 



The position of each selected probe on the membrane strip is shown at the left of each panel. The 
1 sequence of the relevant part of the selected probes is given in Table 1. Each strip is incubated with a 

biotinylated PCR fragment from the reference panel. The reference panel accession numbers are 
indicated in Table 1 . For several probes multiple reference panel possibilities are available, but only one 

i Sj 20 relevant accession number given each time. *: False positive reactivities. At the bottom of the strips, the 

y i 

Q amino acids at the relevant codon, as derived from the probe reactivity, is indicated. 

Figure 2 D: Reactivities of the selected probes for codon 54 immobilized on LiPA strips with reference 
material. The information in the boxed surface is not relevant for the discussion of probes for condon 54. 

25 The position of each selected probe on the membrane strip is shown at the left of each panel. The 
sequence of the relevant part of the selected probes is given in Table 1. Each strip is incubated with a 
biotinylated PCR fragment from the reference panel. The reference panel accession numbers are 
indicated in Table 1 . For several probes multiple reference panel possibilities are available, but only one 
relevant accession number given each time. *: False positive reactivities. At the bottom of the strips, the 

30 amino acids at the relevant codon, as derived from the probe reactivity, is indicated. 



Figure 2 E.:Reactivities of the selected probes for codons 82 and 84 immobilized on LiPA strips with 
reference material. The information in the boxed surface is not relevant for the discussion of probes for 
condons 82 and 84. The position of each selected probe on the membrane strip is shown at the left of 
35 each panel. The sequence of the relevant part of the selected probes is given in Table 1. Each strip is 
incubated with a biotinylated PCR fragment from the reference panel. The reference panel accession 
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numbers are indicated in Table 1 . For several probes multiple reference panel possibilities are available, 
but only one relevant accession number given each time. *: False positive reactivities. At the bottom of 
the strips, the amino acids at the relevant codon, as derived from the probe reactivity, is indicated. 

Figure 2 F: Reactivities of the selected probes for codon 90 immobilized on LiPA strips with reference 
material. The information in the boxed surface is not relevant for the discussion of probes for condon 90. 
The position of each selected probe on the membrane strip is shown at the left of each panel. The 
sequence of the relevant part of the selected probes is given in Table 1. Each strip is incubated with a 
biotinylated PCR fragment from the reference panel. The reference panel accession numbers are 
indicated in Table 1 . For several probes multiple reference panel possibilities are available, but only one 
relevant accession number given each time. *: False positive reactivities. At the bottom of the strips, the 
amino acids at the relevant codon, as derived from the probe reactivity, is indicated. 

Figure 3: Sequence and position of the HIV-1 protease amplification primers. To obtain the reactivity 
with probes selected to determine the susceptibility to antiviral drugs involving codons 30, 46, 48, 50, 
54, 82, and 84, nested amplification primers prot2bio(5' primer) and Prot31bio (3' primer) were designed. 
To obtain the reactivity with probes selected to determine the susceptibility to antiviral drugs involving 
codon 90, nested amplification primers Prot41bio (5' primer) and Prot6bio (3* primer) were designed. 

Figure 4 A: Phylogenetic analysis on 3 12 protease sequences allowed to separate genotype B strains from 
non-B strains. Reactivities of the selected probes for codon 30 immobilized on LiPA strips with a 
biotinylated PCR fragment of genotype B strains and non-B strains is shown, the exact percentages are 
indicated in table 5. The probes are indicated at the bottom. The sequence of the relevant part of the 
probes is given in Table 1 . 

Figure 4 B: Phylogenetic analysis on 312 protease sequences allowed to separate genotype B strains from 
non-B strains. Reactivities of the selected probes for codons 46/48 immobilized on LiPA strips with a 
biotinylated PCR fragment of genotype B strains and non-B strains is shown, the exact percentages are 
indicated in table 5. The probes are indicated at the bottom. The sequence of the relevant part of the 
probes is given in Table 1 . 

Figure 4 C: Phylogenetic analysis on 3 12 protease sequences allowed to separate genotype B strains from 
non-B strains. Reactivities of the selected probes for codon 50 immobilized on LiPA strips with a 
biotinylated PCR fragment of genotype B strains and non-B strains is shown, the exact percentages are 
indicated in table 5. The probes are indicated at the bottom. The sequence of the relevant part of the 
probes is given in Table 1 . 
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Figure 4 D: Phylogenetic analysis on 312 protease sequences allowed to separate genotype B strains from 
non-B strains. Reactivities of the selected probes for codon 54 immobilized on LiPA strips with a 
biotinylated PCR fragment of genotype B strains and non-B strains is shown, the exact percentages are 
indicated in table 5. The probes are indicated at the bottom. The sequence of the relevant part of the 
probes is given in Table 1 . 

Figure 4 E: Phylogenetic analysis on 312 protease sequences allowed to separate genotype B strains from 
non-B strains. Reactivities of the selected probes for codons 82/84 immobilized on LiPA strips with a 
biotinylated PCR fragment of genotype B strains and non-B strains is shown, the exact percentages are 
indicated in table 5. The probes are indicated at the bottom. The sequence of the relevant part of the 
probes is given in Table 1 . 

Figure 4 F: Phylogenetic analysis on 312 protease sequences allowed to separate genotype B strains from 
non-B strains. Reactivities of the selected probes for codon 90 immobilized on LiPA strips with a 
biotinylated PCR fragment of genotype B strains and non-B strains is shown, the exact percentages are 
indicated in table 5. The probes are indicated at the bottom. The sequence of the relevant part of the 
probes is given in Table 1. 

Figure 5 A: Geographical origin of 856 samples and reactivities with the different probes at codon 
position 30. The exact percentages are indicated in table 6. The probes are indicated at the bottom. 

Figure 5 B: Geographical origin of 856 samples and reactivities with the different probes at codon 
positions 46/48. The exact percentages are indicated in table 6. The probes are indicated at the bottom. 

Figure 5 C: Geographical origin of 856 samples and reactivities with the different probes at codon 
position 50. The exact percentages are indicated in table 6. The probes are indicated at the bottom. 

Figure 5 D: Geographical origin of 856 samples and reactivities with the different probes at codon 
position 54. The exact percentages are indicated in table 6. The probes are indicated at the bottom. 

Figure 5 E: Geographical origin of 856 samples and reactivities with the different probes at codon 
positions 82/84. The exact percentages are indicated in table 6. The probes are indicated at the bottom. 

Figure 5 F: Geographical origin of 856 samples and reactivities with the different probes at codon 
position 90. The exact percentages are indicated in table 6. The probes are indicated at the bottom. 
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Table 1: HIV-1 protease wild-type and drug-selected mutation probes with their corresponding 
sequences as applied on the HIV-1 protease LiPA strip. The most frequently observed wild-type 
sequence is shown at the top line. Probe names corresponding to the selected motifs are indicated in the 
left column, the relevant part of each probe applied on the strip is shown under the consensus sequence. 



Table 2: Protease Inhibitors. 

Table 3: HIV-1 protease wild-type and drug-selected mutation probes with their corresponding 
sequences as synthesized, immobilized and tested on LiPA slrips. The most frequently observed wild- 
type sequence is shown at the top line. Probe names corresponding to the selected motifs are indicated 
in the left column, the relevant part of each probe applied on the strip is shown under the consensus 
sequence. The probes retained are indicated in table 1. 

15 Table 4: Polymorphic nucleotide sequences. 

Table 5: % Reactivities of the HTV-1 protease wild-type and drug-selected mutation probes applied on 
the HTV-1 protease LiPA strip with genotype B strains and non-B strains. 



Table 6: % Reactivities of the HTV-1 protease wild-type and drug-selected mutation probes applied on 
the HIV-1 protease LiPA strip with samples of different geographical origin. 



Table 7: HTV-1 protease wild-type and drug-selected mutation probes with their corresponding sequences 
as applied on the HTV-1 protease LiPA strip. The most frequently observed wild-type sequence is shown 
25 at the top line. Probe names corresponding to the selected motifs are indicated in the left column, the 
relevant part of each probe applied on the strip is shown under the consensus sequence. 

EXAMPLES 
30 Example 1: 

Selection of the plasma samples. PCR amplifica tion and cloning of the PPR p rn,w*c 

Plasma samples (n=557) were taken from HTV type-1 infected patients and stored at -20°C until 
use. Plasma samples were obtained from naive and drug-treated patients. The drugs involved ritonavir, 
35 indinavir and saquinavir. The serum samples were collected from patients residing in Europe (Belgium! 
Luxembourg, France, Spain and UK), USA and Brazil. 
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HIVRNA was prepared from these samples using the guanidinium-phenol procedure. Fifty 
plasma was mixed with 150 ul Trizol®LS Reagent (Life Technologies, Gent, Belgium) at room 
temperature (volume ratio: lunit sample/ 3 units Trizol ). Lysis and denaturation occurred by carefully 
pipetting up and down several times, followed by an incubation step at room temperature for at least 5 
minutes. Fourthy ul CHC1 3 was added and the mixture was shaken vigorously by hand for at least 15 
seconds, and incubated for 15 minutes at room temperature. The samples were centrifuged at maximum 
12,000g for 15 minutes at 4°C, and the colorless aqueous phase was collected and mixed with 100 ul 
isopropanol. To visualize the minute amounts of viral RNA, 20 ul of lug/ul Dextran T500 (Pharmacia) 
was added, mixed and left at room temperature for 10 minutes. Following centrifugation at max. 12,000g 
for 10 minutes at 4°C and aspiration of the supernatant, the RNA pellet was washed with 200 ul ethanol, 
mixed by vortexing and collected by centrifugation at 7,500g for 5 minutes at 4°C. Finally the RNA 
pellet was briefly air-dried and stored at -20°C. Alternatively, the High Pure Viral Nucleic Acid Kit 
(Boehringer Mannheim ) was used to extract RNA from the samples 

For cDNA synthesis and PCR amplification the RNA pellet was dissolved in 15 ul random 
primers (20 ng/ul, pdN 6 , Pharmacia), prepared in DEPC-treated or HPLC grade water. After denaturation 
at 70°C for 10 minutes, 5 ul cDNA mix was added, composed of 4 ul 5x AMV-RT buffer (250mM 
Tris.HCl pH 8.5, lOOmM KC1, 30mM MgCl 2 , 25 mM DTT), 0.4 uL 25mM dXTPs, 0.2 ul or 25U 
Ribonuclease Inhibitor (HPRI, Amersham), and 0.3 ul or 8U AMV-RT (Stratagene). cDNA synthesis 
occurred during the 90 minutes incubation at 42°C. The HIV -1 protease gene was than amplified using 
the following reaction mixture: 5 ul cDNA, 4.5 ul lOx Taq buffer, 0.3 ul 25 mM dXTPs, 1 ul (10 pmol) 
of each PCR primer, 38 ul H z O, and 0.2 ul (1 U) Taq. . Alternatively, the Titon One Tube RT-PCR 
system (Boehringer Mannheim) was used to perform RT-PCR. 

Codon positions involving resistance to saquinavir, ritonavir, indinavir, nelfinavir and VX-478 have been 
described (Shinazi et at) and PCR amplification primers were chosen outside these regions. The primer 
design was based on HIV-1 published sequences (mainly genotype B clade) (Myers et al.) and located 
in regions that showed a high degree of nucleotide conservation between the different HTV-l clades. The 
final amplified region covered the HIV-1 protease gene from codon 9 to codon 99. The primers for 
amplification had the following sequence: outer sense primer Prl6: 5' bio- 
CAGAGCCAACAGCCCCACCAG3' (SEQ ID NO 1); nested sense primer Prot 2 bio: 5' CCT CAR ATC 
ACT CTT TGG CAA CG 3' (SEQ ID NO 3) ; nested antisense primer Prot 6 bio: 3' TAA TCR GGA 
TAA CTY TGA CAT GGT C 5 1 (SEQ ID NO 4); and outer antisense primer RT12: 5' 
bioATC AGGATGGAGTTCATAACCCATCCA3 ' (SEQ ID NO 2). Annealing occurred at 57°C, 
extension at 72°C and denaturation at 94°C. Each step of the cycle took 1 minute, the outer PCR 
contained 40 cycles, the nested round 35. Nested round PCR products were analyzed on agarose gel and 
only clearly visible amplification products were used in the LiPA procedure. Quantification of viral 
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RNA was obtained with the HIV Monitor™test (Roche, Brussels, Belgium). Later on, new sets of 
primers for amplification were selected. For the amplification of HIV protease codon 30-84: outer sense 
primer P rotl6: 5'-CAGAGCCAACAGCCCCACCAG-3' (SEQ ID NO 501), outer antisense primer 
protS: 5 ' -TTTTCTTCTGTCAATGGCCATTGTTT-3 ' (SEQ ID NO 502) were used. Annealing occurred 
at 50°C, extension at 68°C and denaturation at 94°C for 35 cycles for the outer PCR. For the nested PCR 
annealing occurred at 45°C, denaturation at 94°C and extension at 92°C with primers: nested sense 
primers prot2a-bio: 5 '-bio-CCTCAAATCACTCTTTGGCAACG-3 ' (SEQ ID NO 503)and prot2b-bio: 
5 ' -bio-CCTCAGATCACTCTTTGGCAACG-3 ' (SEQ ID NO 504), and nested antisense primer prot31- 
bio: 5 '-bio-AGTCAACAGATTTCTTCCAAT-3 ' (SEQ ID NO 6). For the amplification of HTV protease 
codon 90, the outer PCR was as specified for HTV protease codon 30-84. For the nested PCR, nested 
sense primer prot41-bio: 5 '-bio-CCTGTCAACATAATTGCAAG-3 ' (SEQ ID NO 505) and nested 
antisense primers prot6a: 5 '-bio-CTGGTACAGTTTCAATAGGGCTAAT-3 ' (SEQ ID NO 506), prot6b: 
5 ' -bio-CTGGTACAGTTTCAATAGG ACTAAT-3 ' (SEQ ID NO 507), prot6c: 5'-bio- 
CTGGTACAGTCTCAATAGGACTAAT-3 ' (SEQ ID NO 508), prot6d: 5'-bio- 
CTGGTACAGTCTCAATAGGGCTAAT-3 ' (SEQ ID NO 509) were used. For the nested PCR the 
annealing temperature occurred at 45°C. Primers were tested on a plasmid, which contained an HIV 
fragment of 1301 bp ligated in a pGEM-T vector. The fragment contains protease, reverse transcriptase 
and the primer sites of first and second round PCR. By restriction with Sac /the plasmid is linearised. 

Selected PCR products were cloned into the pretreated EcoRV site of the pGEMT vector 
(Promega). Recombinant clones were selected after a-complementation and restriction fragment length 
analysis, and sequenced using standard sequencing techniques with plasmid primers and internal HIV 
protease primers. Sometimes biotinylated fragments were directly sequenced with a dye-terminator 
protocol (Applied Biosystems) using the amplification primers. Alternatively, nested PCR was carried 
out with analogs of the nested primers, in which the biotin group was replaced with the T7- and SP6- 
primer sequence, respectively. These amplicons were than sequenced with an SP6- and T7-dye-primer 
procedure. 



Example 2 : 

Selection of a reference panel 

Codon positions involving resistance to saquinavir, ritonavir, indinavir, nelfinavir and VX-478 
have been described (Shinazi et al. 1997). It was the aim to clone in plasmids those viral protease genes 
that are covering the different genetic motifs at those important codon positions conferring resistance 
against the described protease inhibitors. 

After careful analysis of 312 protease gene sequences, obtained after direct sequencing of PCR 
fragments, a selection of 47 PCR fragments which covered the different target polymorphisms and 
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mutations were retained and cloned in plasmids using described cloning techniques. The selection of 
samples originated from naive or drug-treated European, Brazilian or US patients. These 47 
recombinant plasmids are used as a reference panel, a panel that was sequenced on both strands, and 
biotinylated PCR products from this panel were used to optimize probes for specificity and sensitivity. 

Although this panel of 47 samples is a representative selection of clones at this moment, it is 
important to mention here that this selection is an fact only a temporally picture of the variability of the 
virus, and a continuous update of this panel will be mandatory. This includes on ongoing screening for 
the new variants of the virus, and recombinant cloning of these new motifs. 

Probe selection and LiPA testing . 

To cover all the different genetic motifs in the reference panel, a total of 471 probes were 
designed (codon 30: 40 probes; codon 46/48: 72 probes; codon 50:55 probes; codon 54: 54 probes; codon 
82/84: 130 probes; codon 90: 120 probes).Table 3 shows the different probes that were selected for the 
different codon positions. 

It was the aim to adapt all probes to react specifically under the same hybridization and wash 
conditions by carefully considering the % (G+C), the probe length, the final concentration of the buffer 
components, and hybridization temperature (Stuyver et al, 1997). Therefore, probes were provided 
enzymatically with a poly-T-tail using the TdT (Pharmacia) in a standard reaction condition, and purified 
via precipitation. For a limited number of probes with 3' T-ending sequences, an additional G was 
incorporated between the probe sequence and the poly-T-tail in order to limit the hybridizing part to the 
specific probe sequence and to exclude hybridization with the tail sequence. Probe pellets were dissolved 
in standard saline citrate (SSC) buffer and applied as horizontal parallel lines on a membrane strip. 
Control lines for amplification (probe 5' TAGGGGGAATTGGAGGTTTTAG 3 \ HIV protease aa 47 
to aa 54) and conjugate incubation (biotinylated DNA) were applied alongside. Probes were 
immobilized onto membranes by baking, and the membranes were sliced into 4mm strips also called 
LiPA strips. 

Selection of the amplification primers and PCR amplification was as described in example 1. 
In order to select specific reacting probes out of the 471 candidate probes, LiPA tests were performed 
with biotinylated PCR fragments from the reference panel. To perform LiPA tests, equal amounts (10 
Hi) of biotinylated amplification products and denaturation mixture (0.4 N NaOH/0.1% SDS) were 
mixed, followed by an incubation at room temperature for 5 minutes. Following this denaturation step, 
2 ml hybridization buffer (2xSSC, 0.1% SDS, 50mM Tris pH7.5) was added together with a membrane 
strip and hybridization was carried out at 39°C for 30 min. Then, the hybridization mixture was replaced 
by stringent washing buffer (same composition as hybridization buffer), and stringent washing occurred 
first at room temperature for 5 minutes and than at 39°C for another 25 minutes. Buffers were than 
replaced to be suitable for the streptavidine alkaline phosphatase conjugate incubations. After 30 minutes 
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incubation at room temperature, conjugate was rinsed away and replaced by the substrate components 
for alkaline phosphatase, Nitro-Blue-Tetrazolium and 5-Bromo-4-Chloro-3-Indolyl Phosphate. After 30 
minutes incubation at room temperature, probes where hybridization occurred became visible because 
of the purple brown precipitate at these positions. 

After careful analysis of the 471 probes, the most specific and sensitive probes (n=46) were 
finally selected, covering the natural and drug-selected variability in the vicinity of aa. 30, 46, 48, 50, 
54, 82, 84, and 90. Figure 2 shows the reactivity of the finally selected probes with the reference panel. 

Example 3 : 

LiPA testing on clinical samples . 

A total of 856 samples were tested on this selection of 46 specific probes. The geographical 
origin of these samples is as follows: USA:359 ; France: 154; UK:36; Brazil 58; Spain 35; Belgium 199; 
Luxembourg: 15. 

From this population, a total of 144 samples were sequenced which allowed to separate the 
genotype B samples (94) from the non-B samples (50). After analysis of these genotyped samples on 
LiPA, the genotypic reactivity on the selected probes was scored. Figures 4A to 4F show these results 
for tiie different codon positions and for the genotype B versus non-B group. From these tables, it is clear 
that there is little difference in sequence usage for the different codon positions with respect to specific 
reactivities at the different probes. 

The total collection of 856 samples was then tested on the available 46 probes. After dissection 
of these reactivities over the different probes and different geographical origin, the picture looks as is 
presented in Figures 5A to 5F. Again here, the majority of the sequences used at the different codon 
positions are restricted to some very abundant wild type motifs. It is important to mention here that the 
majority of these samples are taken from patients never treated with protease inhibitors, en therefore, the 
majority of the reactivities are found in wild type motifs. Nevertheless, it is clear from some codon 
positions that the variability at some codon positions in the mutant motif might be considerable, and 
again, a continuos update on heavily treated patients is mandatory. Another issue is the amount of double 
blank reactivities, which is in this approach reaching up to 5% in global; with some peak values for some 
countries for some codon positions: for example 13.8% for codon 82/85 in Brazil; and 18.1 % for codon 
90 in Belgium. 

The continuous update resulted in a further selection of probes. This later configuration of the 
strip is indicated in table 7. 
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Table 1 



26 


27 


28 


29 


30 


31 


32 


33 


ACA 


GGA 


GCA 


GAT 


GAT 


ACA 


GTA 


TTA 






GCA 


GAT 


GAT 


ACA 


GT 






A 


GCG 


GAT 


GAT 


ACA 










GCA 


GAT 


GAC 


ACA 


GT 








GCA 


GAC 


GAT 


ACA 


GG 






A 


GCA 


GAT 


AAT 


ACA 


GT 




44 


45 


46 


47 


48 


49 


50 


51 


CCA 


AAA 


ATG 


ATA 


GGG 


GGA 


ATT 


GGA 




AAA 


ATG 


ATA 


GGG 


GGA 








A 


ATG 


ATA 


GGA 


GGA 


ATT 




A 


AAA 


ATA 


ATA 


GGG 


GGA 










ATG 


ATA 


GTG 


GGA 


ATT 





34 3 Tm lengte Seq ID 



pc30w25 
pc30w29 
pc30w32 
pc30w36 
pc30m23 



pc48w47 
pc48w45 
pc48w72 
pc4 8m41 

48 49 50 51 52 53 54 

GGG GGA ATT GGA GGT TTT ATC 

pc50w31 GGA ATT GGA GGT TTT 

pc50w4 4 GGA ATT GGG GGT TTG 

pc50w52 GA ATT GGA GGC TTG 

pc50m37 GGG GGA GTT GGA 



52 





51 


52 


53 


54 


55 


56 


57 


58 


pc54w3 


GGA 


GGT 


TTT 


ATC 


AAA 


GTA 


AGA 


CAG 




GT 


TTT 


ATC 


AAA 


GTA 


AGA 


pc54w34 


GA 


GGT 


TTT 


ATC 


AAA 


GT 






pc54wl4 




GGT 


TTT 


ATC 


AAG 


GTA 


A 




pc54wl9 


A 


GGC 


TTT 


ATC 


AAA 


GTA 






pc54w22 


GA 


GGT 


TTT 


ATT 


AAA 


GTA 






pc54w26 


A 


GGT 


TTC 


ATT 


AAG 


GTA 






pc54w27 




GGT 


TTT 


ATT 


AAG 


GTA 


A 




pc54m55 


A 


GGT 


TTT 


GCC 


AAA 


GT 






pc54m35 




GGT 


TTT 


GTC 


AAA 


GTA 






pc54m37 




GGT 


TTT 


GTC 


AGA 


GTA 







78 79 80 81 82 83 84 85 86 87 

rt , GGA CCT ACA CCT GTC ATA ATT GGA AGA 

pc82w91 ACA CCT GTC AAC ATA A 
pc82w60 CA CCT GTC AAT ATA ATG 

pc82wlll A CCG GTC AAC ATA ATT 

pc82w8 9 ACA CCT GTT AAC ATA AG 

pc82w42 CA CCT GTC AAC GTA 

pc82m3 6 ACA CCT ACC AAC ATA 

pc82m67 ACA CCT ACC AAC GT 

pc82m38 ACA CCT TTC AAC ATA 

pc82ml05 ACG CCC TTC AAC ATA 

pc82ml27 CA CCT TTC AAC GTA ATG 

pc82m4 0 ACA CCT GCC AAC ATA 

pc82m63 CA CCT GCC AAT ATA AG 

pc 8 2ml 01 ACA CCT ATC AAC ATA ATG 



4 0 


1 A 

J. H 


J J. 


36 


1 "3 
X o 


o D 


42 


14 


38 


40 


14 


42 


40 


15 


29 


4 7 


1 R 

± D 


y o 


42 


1 £ 


Q 1 

y i 


42 




ion 


40 






42 


15 


151 


42 


15 


164 




14 


172 


40 


12 


157 


42 


1 7 


I/O 


42 


1 

-L Q 


919 


42 


1 

-L D 


i on 

x o y 


42 




1 Q A 

i y h 


4 7 


1 1 


1 Q *7 

i y / 


4 7 


1 £ 


i n o 
ZUz 


4 n 


1 £ 


Z U 4 


.»> o 


1 R 
X 3 




4 0 


1 R 
± D 




4 7 


1 R 


1 D 


44 


16 


318 


42 


17 


287 


44 


16 


338 


42 


17 


316 


42 


14 


269 


42 


15 


263 


42 


14 


294 


40 


15 


265 


44 


15 


332 


44 


17 


354 


44 


15 


267 


42 


16 


290 


44 


18 


328 
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Table 1 - Cont'd 



86 87 88 89 90 

GGA AGA AAT CTG TTG 

pc90w27 AAT CTG TTG 

pc90w37 AAT CTG TTG 

pc90w39 GA ACT CTG TTG 

pc90w50 AAT ATG TTG 

pc90w52 AAT TTG TTG 

pc90w69 GA AAC CTG TTG 

pc90w73 TG TTG 

pc90w79 TG TTG 

pc90m4 3 A AAT CTG ATG 

pc90m5 6 AAT ATG ATG 



91 


92 


93 


94 








ACT 


CAG 


ATT 


GGT 








ACT 


CA 






38 


14 


384 


ACT 


CAG 


ATG 




42 


18 


394 


ACT 


C 






44 


15 


396 


ACT 


CAG 






40 


15 


407 


ACT 


CAG 






40 


15 


409 


ACT 








40 


14 


426 


ACA 


CAG 


CTT 


G 


44 


15 


430 


ACC 


CAG 


ATT 


G 


44 


15 


436 


ACT 


CA 






40 


15 


400 


ACC 


CAG 






42 


15 


413 
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Table 2 
Protease Inhibitors 



Compound 
Protease Inhibitors 
A-77003 



P 



01 



P9941 

Ro 31-8959 
(saquinavir) 



Amino acid change 



Codon change 



R8Q 


CGA to CAA 


R8K 


CGA to AAA 


V32I 


GTA to ATA 


M4ol 


ATG to ATA 


M46L 


ATG to TTC 


JVl4or 


ATG to TTC 


M46V 


ATG to GTG 




GGG to GTG 


A7I V 


GCT to GTT 


V82I 


GTC to ATC 


V82A 


GTC to GCC 


T AIT) 


CTC to CCC 


A71 T 


GCT to ACT 


A71V 


GCT to GTT 


G73S 


GGT to GCT 


V82A 


GTC to GCC 


V82F 


GTC to TTC 


V82T 


GTC to ACC 


I84V 


ATA to GTA 


L90M 


TTG to ATG 


V82A 


GTC to GCC 


L10I 


CTC to ATC 


G48V 


GGG to GTG 



I54V 

I54V 

G73S 

V82A 

I84V 

L90M 



ATC to GTC 
ATA to GTA 
GGT to AGT 
GTC to GCC 
ATA to GTA 
TTG to ATG 



RPI-312 



I84V 



ATA to GTA 
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Table 2 - 



Cont'd-l 



SC-52151 



SC-55389A 
SKF1 08842 
SKF1 08922 
VB 11,328 



VX-478 
(141W94) 



XM323 



L24V 

G48V 

A71V 

V75I 

P81T 

V82A 

N88D 

L10F 
N88S 

V82T 
I84V 

V82A 
V82T 

L10F 
M46I 
I47V 
I50V 
184V 

L10F 
M46I 
I47V 
I50V 
I84V 

LI OF 
K45I 
M46L 
V82A 

V82I 
V82F 
I84V 



TTA to GTA 
GGG to GTG 
GCT to GTT 
GTA to ATA 
CCT to ACT 
GTC to GCC 
AAT to GAT 

CTC to CGC 
AAT to AGT 

GTC to ACC 
ATA to GTA 

GTC to GCC 
GTC to ACC 

CTC to GGC 
ATG to ATA 
ATA to CTA 
ATT to GTT 
ATA to GTA 

CTC to CGC 
ATG to ATA 
ATA to CTA 
ATT to GTT 
ATA to GTA 

CTC to CGC 
AAA to ATA 
ATG to CTG 
GTC to GCC 

GTC to ATC 
GTC to TTC 
ATA to GTA 



A-75925 

ABT-538 

(ritonavir) 



L97V 

I82T 

V32I 

K20R 

L33F 



TTA to GTA 

ATC to ACC 

GTA to ATA 
AAGto AAA 
TTA to TTC 
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Table 2 - Cont'd-2 



PCT/EP99/04317 



AG1343 
(nelfinavir) 



BILA 1906 
BS 



BILA 2011 
(palinavir) 



BILA 2 185 BS 
BMS 186,318 
DMP450 



M36I 


ATG to ATA 


M4ol 


ATG to ATA 


I54L 


ATC to ? 


I54V 


ATC to GTC 


A71V 


GTC to GTT 


V82F 


GTC to TTC 


V82A 


GTC to GCC 


V82T 


GTC to ACC 


V82S 


GTC to TCC 


I84V 


ATA to GTA 


L90M 


TTG to ATG 



D30N 

M36I 

M46I 

L63P 

A71V 

V771 

184V 

N88D 

L90M 

V32I 
M46I 
M46L 

A71V 

I84A 

184V 

V32I 
A71V 
I84A 
L63P 

L23I 



A71T 
V82A 

L10F 



GAT to AAT 

ATG to ATA 
CTC to CCC 
GCT to GTT 

ATA to GTA 

TTG to ATG 

GTA to ATA 
ATG to ATA 
ATG to TTG 

GCT to GTT 
ATA to GCA 
ATA to GTA 

GTA to ATA 
GCT to GTT 
ATG to ATA 
CTC to CCC 

CTA to ATA 



GCT to ACT 
GTC to GCC 

CTC to TTC 
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M46I 
D60E 
I84V 



Table 2 - Cont'd-3 



ATG to ATA 
GAT to GAA 
ATA to GTA 



KNI-272 



V32I 



GTA to ATA 



MK-639 

(L-735,524, 

indinavir) 



L10I 

L10R 

L10V 

K20M 

K20R 

L24I 

V32I 

M46I 

M46L 

I54V 



CTC to ATC 
CTC to CGC 
CTC to GTC 
AAG to ATG 
AAGto AAA 
TTA to ATA 
GTA to ATA 
ATG to ATA 
ATG to TTG 
ATC to GTC 
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Table 3 



O 



P30wl 

P30w2 

P30w3 

P30w4 

P30w5 

P30w6 

P30w7 

P30w8 

P30w9 

P30wl0 

P30mll 

P30ml2 

P30ml3 

P30ml4 

P30ml5 

P30ml5 

P30ml7 

P30ml8 

P30ml9 

P30m20 

p30w21 

p30w22 

p30m23 

p30m24 

p30w25 

p30w26 

p30w27 

p30w28 

p30w29 

p30m30 

p30m31 

p30w32 

p30w33 

p30w34 

p30w35 

p30w36 

p30w37 

p30w38 

p30w39 

p30w40 



26 
ACA 



ACA 
CA 
A 

ACA 



ACA 
CA 
A 

ACA 



27 
GGA 
A 
GA 
A 
GGA 
GGA 
GGA 
GGA 
GGA 
GGA 
GGA 
A 
GA 
A 
GGA 
GGA 
GGA 
GGA 
GGA 
GGA 
GGA 
A 
A 
A 
A 



GA 
A 



28 

GCA 

GCA 

GCA 

GCA 

GCA 

GCA 

GCA 

GCA 

GCA 

GCA 

GCA 

GCA 

GCA 

GCA 

GCA 

GCA 

GCA 

GCA 

GCA 

GCA 

GCA 

GCA 

GCA 

GCA 

GCA 

GCA 

GCA 

CA 
GCG 
GCG 
GCA 
GCA 
GCA 

CA 

CA 
GCA 
GCA 
GCA 
A 
A 
GCA 



29 

GAT 

GAT 

GAT 

GAT 

GAT 

GAT 

GAT 

GAT 

GAT 

GAT 

GAT 

GAT 

GAT 

GAT 

GAT 

GAT 

GAT 

GAT 

GAT 

GAT 

GAT 

GAT 

GAT 

GAT 

GAT 

GAT 

GAT 

GAT 

GAT 

GAT 

GAT 

GAT 

GAT 

GAT 

GAT 

GAT 

GAC 

GAC 

GAT 

GAT 

GAT 



30 

GAT 

GAT 

GAT 

GAT 

GAT 

GAT 

GAT 

GAT 

GAT 

GAT 

GAT 

AAT 

AAT 

AAT 

AAT 

AAT 

AAT 

AAT 

AAT 

AAT 

AAT 

GAT 

GAT 

AAT 

AAT 

GAT 

GAT 

GAT 

GAT 

GAT 

AAT 

AAT 

GAC 

GAC 

GAT 

GAT 

GAT 

GAT 

GAT 

GAT 

GAT 



31 

ACA 

ACA 

ACA 

ACA 

ACA 

ACA 

ACA 

ACA 

ACA 

ACA 

ACA 

ACA 

ACA 

ACA 

ACA 

ACA 

ACA 

ACA 

ACA 

ACA 

ACA 

ACA 

ACA 

ACA 

ACA 

ACA 

ACA 

ACA 

ACA 

ACA 

ACA 

ACA 

ACA 

ACA 

ACA 

ACA 

ACA 

ACA 

ACA 

ACA 

ACA 



32 33 34 35 

GTA TTA GAA GAA 

GTA TT- 

GTA TT 

GTA TTA 

GTA TT 

GTA TTA 

GT 

GTA TG 
GTA TG 
GG 

GTA TT 
GTA TT 
GTA TTA 
GTA TT 
GTA TTA 

GT 

GTA TG 
GTA TG 
GG 
GT 

GTA G 

GTA 

GTA G 

GT 

GG 

GT 



GTA 

GT 

GT 

GTA G 
ATA TT 
ATA TG 
GG 
GT 

ATA TT 
ATA TTA 
ATA 



length Seq ID 



1 p 
X o 


-7 

/ 


1 Q 
X y 


o 
o 


1 Q 

x y 


y 


9 n 
z u 


1U 


9 1 
z X 


X X 


1 Q 
X o 


1 o 

Iz 


x y 


13 


9 Ci 

2 U 


14 


x y 


15 


x y 


16 


1 o 


17 


l y 


18 


l y 


19 


Z (J 


20 


9 1 

Z X 


21 


X O 


9 9 

ZZ 


1 Q 

x y 


9 9 

23 


9 A 
Z U 


9 A 

z 4 


x y 


25 


i y 


2 6 


Xo 


9 ~7 

21 


X D 


28 


X o 




X D 


3U 


1 A 
X H 


31 


14 


32 


13 


33 


14 


34 


13 


35 


15 


36 


14 


37 


14 


38 


14 


39 


16 


40 


16 


41 


13 


42 


14 


43 


15 


44 


16 


45 


15 


46 
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Table 3 - Cont'd-l 



44 45 46 
CCA AAA ATG 



P48wl 






P4 8w2 






P48w3 






P4 8w4 






P48w5 




G 


P48w6 




ATG 


P48w7 




ATG 


P48w8 


A 


ATG 


P48w9 


A 


ATG 


P48wl0 


A 


ATG 


P48w21 




ATA 


P48w22 




ATG 


P48w23 


A 


ATA 


P48w24 


A 


ATG 


P48w25 






P48w26 






P48w28 






P48w29 






P48mll 






P48ml2 






P48ml3 






P48ml4 






P48ml5 




G 


P48ml6 




ATG 


P48ml7 




ATG 


P48ml8 


A 


ATG 


P48ml9 


A 


ATG 


P4 8m20 


A 


ATG 


P48m29 






P48m30 






P48m31 




ATG 


P4 8m32 




ATG 


P48m33 


A 


ATG 


p48w34 




G 


p48w35 




TG 


p48w36 




TG 


p48w37 




ATG 


p48m38 




G 


p48m39 




TG 


p48m40 




TG 


p48m41 




ATG 


p48w42 




ATA 


p48w43 




TG 


p48w44 




G 


p48w45 


A 


ATG 


p48w46 




ATG 


p48w47 


AAA 


ATG 


p48w48 


A AAA 


ATG 



47 48 49 50 51 52 
ATA GGG GGA ATT GGA GGT 
GTA GGG GGA ATT GGA GGT 
GTA GGG GGA ATT GGA GGT 
GTA GGG GGA ATT GGA GGT 
GTA GGG GGA ATT GGA GGT 
GTA GGG GGA ATT GGA GGT 
GTA GGG GGA ATT GGA 
GTA GGG GGA ATT GGA G 
GTA GGG GGA ATT GGA 
GTA GGG GGA ATT GGA G 
GTA GGG GGA ATT GGA GGG 
ATA GGG GGA ATT GGA 
ATA GGG GGA ATT GGA 
ATA GGG GGA ATT GGA 
ATA GGG GGA ATT GGA 
ATA GGG GGA ATT GGA GGT 
ATA GGG GGA ATT GGA GGT 
ATA GGG GGA ATT GGA GGT 
ATA GGG GGA ATT GGA GGT 
GTA GTG GGA ATT GGA GGT 
GTA GTG GGA ATT GGA GGT 
GTA GTG GGA ATT GGA GGT 
GTA GTG GGA ATT GGA GGT 
GTA GTG GGA ATT GGA GGT 
GTA GTG GGA ATT GGA 
GTA GTG GGA ATT GGA G 
GTA GTG GGA ATT GGA 
GTA GTG GGA ATT GGA G 
GTA GTG GGA ATT GGA GGG 

ATA GTG GGA ATT GGA GGT 

ATA GTG GGA ATT GGA GGT 

ATA GTG GGA ATT GGA 

ATA GTG GGA ATT GGA G 

ATA GTG GGA ATT GGA 

ATA GGG GGA ATT G 

ATA GGG GGA ATT G 

ATA GGG GGA ATT GG 

ATA GGG GGA ATT 

ATA GTG GGA ATT G 

ATA GTG GGA ATT G 

ATA GTG GGA ATT GG 

ATA GTG GGA ATT 

ATA GGG GGA ATT 

ATA GGG GGA GTT 

ATA GGG GGA GTT G 

ATA GGA GGA ATT 

ATA GGG GGA ATT 

ATA GGG GGA 

ATA GGG GG 



53 54 length Seq ID 
TTT ATC 



GG 


1 ft 

X O 


A 7 
H f 


TG 


1 Q 

X _? 


A Q 
H O 






A Q 


TTT 

-L _L J. 


9 1 
z X 


n 

D U 


TTG 


z. X 


D X 




1 P 






1 Q 

X y 






1 Q 
X z? 


C: A 




9 n 


JJ 


GG 


9 9 


D o 




1 P 

X o 


K 7 




1 p 
X o 


Ob 




1 Q 

x y 


Q 

o y 




1 Q 

X _3 


bu 


GG 


1 ft 

X o 


D X 


TG 




£9 
DZ 


TTG 


9 n 


£ "3 
DJ 


TTT 


9 1 

<L X 


&A 


GG 


1 ft 

X o 


DO 


TG 


1 Q 

X _7 


D D 


TTG 


9 n 


£7 


TTT 


9 1 

Z. X 


£P 


TTG 


91 
z. X 






1 ft 
X o 


/ u 




X _? 


7 1 
/ X 




X Z> 


7 9 
/ Z 




9 n 


7 "5 


GG 


9 9 


7 A 


GG 


1 ft 

X o 


7 c: 




1 Q 

X z? 


7 a 




1 p 
X o 


"7 ~7 

I 1 




1 Q 

x y 


/ O 




1 o 
x y 


"7 O 

/ y 




14 


80 




15 


81 




16 


82 




15 


83 




14 


84 




15 


85 




16 


86 




15 


87 




15 


88 




14 


89 




14 


90 




16 


91 




15 


92 




15 


93 




15 


94 



WO 99/67428 




pn ow4 y 




AA 


ATG 


p4oWo U 




AAA 


ATA 


p4 OWOl 




AAA 


ATA 


w /I O™ c o 

p4ombz 




AAA 


ATG 


/I On c ou 
p4t oWOZC 




AAA 


TTG 


pn OlTlOO 




AAA 


ATG 


p4 oWj JC 




AAA 


TTG 


p4 o wo4 


CA 


AAA 


TTG 


p4 OWO O 






ATG 


p4 OWO D 




AA 


ATG 


pft O WO / 


A 


AAA 


ATG 


p^t o Wo o 






ATG 


p H. OWO -7 








p48w60 








p48w61 






ATG 


p48w62 








p48w63 








p48m64 




AAA 


ATA 


p48m65 


A 


AAA 


ATA 


p48m66 


CA 


AAA 


ATA 


p48m67 




AAA 


TTG 


p48m68 


A 


AAA 


TTG 


p4 8m69 


CA 


AAA 


TTG 


p48w70 




AAA 


ATG 


p48w71 


A 


AAA 


ATG 


pc48w72 


A 


AAA 


ATA 
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ATA 


GGG 


GGA 


AG 




ATA 


GGG 


GGA 


AG 




AAA 


AT 








ATA 


GTG 


GGA 


AG 




ATA 


GGG 


GG 






ATA 


GTG 


GGA 






ATA 


GGG 


GGA 






ATA 


G 








GTA 


GGG 


GGA 


ATT 




GTA 


GGG 


GGA 






GTA 


GGG 


G 






ATA 


GGG 


GAA 


ATT 




ATA 


GGG 


GAA 


ATT 


GGA 


ATA 


GGG 


GAA 


ATT 


GGA G 


ATA 


GGG 


GGG 


ATT 




ATA 


GGG 


GGG 


ATT 


GG 


A 


GGG 


GGG 


ATT 


GGA 


ATA 


GTG 


GGA 






ATA 


GTG 


GGA 






ATA 


GTG 


GG 






ATA 


GTG 


GGA 






ATA 


GTG 


GGA 






ATA 


GTG 


G 






ATA 


GGG 


GG 






ATA 


GGG 


G 






ATA 


GGG 


GGA 
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15 


95 


16 


96 


15 


97 


16 


98 


14 


99 


15 


100 


15 


101 


15 


102 


15 


103 


14 


104 


14 


105 


15 


106 


15 


107 


16 


108 


15 


109 


14 


110 


13 


111 


15 


112 


16 


113 


16 


114 


15 


115 


16 


116 


15 


117 


14 


118 


14 


119 


16 


120 
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Table 3 - Cont'd-3 





45 


46 


47 


48 


49 


50 


51 


52 


53 


54 


length Seq ID 




AAA 


ATG 


GTA 


GGG 


GGA 


ATT 


GGA 


GGT 


TTT 


ATC 






P50wl 








GGG 


GGA 


ATT 


GGA 


GGT 


TTT 




18 


121 


P50w2 






A 


GGG 


GGA 


ATT 


GGA 


GGT 


TTT 




19 


122 


P50w3 






TA 


GGG 


GGA 


ATT 


GGA 


GGT 


TTT 




20 


123 


P50w4 






A 


GGG 


GGA 


ATT 


GGA 


GGT 


TTT 


AG 


20 


124 


P50w5 






TA 


GGG 


GGA 


ATT 


GGA 


GGT 


TTT 


AG 


21 


125 


P50w6 






GTA 


GGG 


GGA 


ATT 


GGA 


GGT 


TGG 




19 


126 


P50w7 




G 


GTA 


GGG 


GGA 


ATT 


GGA 


GGT 


TGG 




20 


127 


P50w8 






GTA 


GGG 


GGA 


ATT 


GGA 


GGT 


TTG 




20 


128 


P50w9 






GTA 


GGG 


GGA 


ATT 


GGA 


GGT 


TTT 




20 


129 


P50wl0 




TG 


GTA 


GGG 


GGA 


ATT 


GGA 


GGT 


GG 




20 


130 


p50w21 








GG 


GGA 


ATT 


GGA 


GGT 


TTT 




17 


131 


P50w22 








GG 


GGA 


ATT 


GGA 


GGT 


TTG 




16 


132 


P50w23 








GG 


GGA 


ATT 


GGA 


GGT 


TTT 


AG 


18 


133 


P50w24 








GG 


GGA 


ATT 


GGA 


GGT 


TG 




15 


134 


P50w25 








G 


GGA 


ATT 


GGA 


GGT 


TTT 


AT 


18 


135 


P50w26 








GG 


GGA 


ATT 


GGA 


GGT 


TTT 




17 


136 


P50mll 








GGG 


GGA 


GTT 


GGA 


GGT 


TTT 




18 


137 


P50ml2 






A 


GGG 


GGA 


GTT 


GGA 


GGT 


TTT 




19 


138 


P50ml3 






TA 


GGG 


GGA 


GTT 


GGA 


GGT 


TTT 




20 


139 


P50ml4 






A 


GGG 


GGA 


GTT 


GGA 


GGT 


TTT 


AG 


20 


140 


P50ml5 






TA 


GGG 


GGA 


GTT 


GGA 


GGT 


TTT 


AG 


21 


141 


P50ml6 






GTA 


GGG 


GGA 


GTT 


GGA 


GGT 


TGG 




19 


142 


P50ml7 




G 


GTA 


GGG 


GGA 


GTT 


GGA 


GGT 


TGG 




20 


143 


P50ml8 






GTA 


GGG 


GGA 


GTT 


GGA 


GGT 


TTG 




20 


144 


P50ml9 






GTA 


GGG 


GGA 


GTT 


GGA 


GGT 


TTT 


ATC 


21 


145 


P50m20 




TG 


GTA 


GGG 


GGA 


GTT 


GGA 


GGT 


GG 




20 


146 


P50m27 








GG 


GGA 


GTT 


GGA 


GGT 


TTG 




19 


147 


P50m28 








GG 


GGA 


GTT 


GGA 


GGT 


TTT 


AG 


18 


148 


P50m2 9 








GG 


GGA 


GTT 


GGA 


GGT 


TG 




15 


149 


P50m30 








G 


GGA 


GTT 


GGA 


GGT 


TTT 


AT 


18 


150 


p50w31 










GGA 


ATT 


GGA 


GGT 


TTT 




15 


151 


p50w32 








G 


GGA 


ATT 


GGA 


GGT 


TGG 




15 


152 


p50m33 










GGA 


GTT 


GGA 


GGT 


TTT 




15 


153 


p50m34 








G 


GGA 


GTT 


GGA 


GGT 


TGG 




14 


154 


p50m35 








GGG 


GGA 


GTT 


GGA 


G 






13 


155 


p50m36 








GG 


GGA 


GTT 


GGA 


G 






12 


156 


p50m37 








GGG 


GGA 


GTT 


GGA 








12 


157 


p50w38 










GGA 


ATT 


GGG 


GGT 


TTG 




14 


158 


p50w39 










GA 


ATT 


GGG 


GGT 


TTT 




14 


159 



WO 99/67428 



PCT/EP99/04317 



Table 3 - Cont'd-4 



p50w40 


GA 


ATT 


GGG 


GGT 


TTT AG 


15 


160 


p50w41 


GGA 


ATT 


GGG 


GGT 


TG 


13 


161 


p50w42 


GGA 


ATT 


GGG 


GGT 


G 


12 


162 


p50w43 


GA 


ATT 


GGG 


GGT 


TG 


12 


163 


p50w44 


GA 


ATT 


GGG 


GGT 


TTG 


13 


164 


p50w45 


GGG GGA 


ATT 


GCA 


G 




13 


165 


p50w46 


GGA 


ATT 


GCA 


GGT 


TG 


14 


166 


p50w47 


GGA 


ATT 


GCA 


GGT 


G 


13 


167 


p50w48 


GGA 


ATT 


GGA 


GGG 


TTG 


14 


168 


p50w49 


GA 


ATT 


GGA 


GGG 


TTG 


13 


169 


p50w50 


GA 


ATT 


GGA 


GGG 


TTT 


14 


170 


p50w51 


GGA 


ATT 


GGA 


GGC 


TTG 


14 


171 


p50w52 


GA 


ATT 


GGA 


GGC 


TTG 


13 


172 


p50w53 


GA 


ATT 


GGA 


GGC 


TTT 


14 


173 


p50m54 


GGA 


GTT 


GGA 


GGT 


TTG 


15 


174 


p50m55 


GA 


GTT 


GGA 


GGT 


TTT 


14 


175 



WO 99/67428 
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51 


52 


53 


54 


55 


56 


57 58 


length 


Seq ID 




GGA 


GGT 


TTT 


ATC 


AAA 


GTA 


AGA CAG 






p54wl 




GGT 


TTT 


ATC 


AAA 


GTA 


A 


16 


176 


p54w2 




GT 


TTT 


ATC 


AAA 


GTA 


AG 


16 


177 


p54w3 




GT 


TTT 


ATC 


AAA 


GTA 


AGA 


17 


178 


p54w4 




T 


TTT 


ATC 


AAA 


GTA 


AGA 


16 


179 


p54w5 




GGT 


TTT 


ATC 


AAA 


GTA 




15 


180 


p54w6 




GT 


TTT 


ATC 


AAA 


GTA 




15 


181 


p54m7 




GGT 


TTT 


GCC 


AAA 


GTA 




15 


182 


p54m8 




GT 


TTT 


GCC 


AAA 


GTA 


A 


15 


183 


p54m9 




GT 


TTT 


GCC 


AAA 


GTA 


AG 


16 


184 


p54ml0 




T 


TTT 


GCC 


AAA 


GTA 


AGA 


16 


185 


p54mll 




GGT 


TTT 


GCC 


AAA 


GT 




14 


186 


p54ml2 




GT 


TTT 


GCC 


AAA 


GTA 




14 


187 


p54wl3 




GT 


TTT 


ATC 


AAG 


GTA 


AA 


16 


188 


p54wl4 




GGT 


TTT 


ATC 


AAG 


GTA 


A 


16 


189 


p54wl5 


A 


GGT 


TTT 


ATC 


AAG 


GTA 




16 


190 


p54wl6 




GT 


TTT 


ATC 


AAA 


GTC 


AGA 


17 


191 


p54wl7 






TTT 


ATC 


AAA 


GTC 


AGA C 


16 


192 


p54wl8 


A 


GGC 


TTT 


ATC 


AAA 


GTA 


A 


17 


193 


p54wl9 


A 


GGC 


TTT 


ATC 


AAA 


GTA 




16 


194 


p54m20 


A 


GGT 


TTT 


ATT 


AAA 


GTA 


A 


17 


195 


p54m21 




GGT 


TTT 


ATT 


AAA 


GTA 


AG 


17 


196 


p54w22 


GA 


GGT 


TTT 


ATT 


AAA 


GTA 




17 


197 


p54m22 


GA 


GGT 


TTT 


ATT 


AAA 


GTA 




17 


198 


p54m23 




GGT 


TTT 


ATT 


GGT 


TTT 


AT 


16 


199 


p54m24 




GGT 


TTC 


ATT 


AAG 


GTA 




15 


200 


p54m25 




GGT 


TTC 


ATT 


AAG 


GTA 


A 


16 


201 


p54w26 


A 


GGT 


TTC 


ATT 


AAG 


GTA 




16 


202 


p5 4m2 6 


A 


GGT 


TTC 


ATT 


AAG 


GTA 




16 


203 


p54w27 




GGT 


TTT 


ATT 


AAG 


GTA 


A 


16 


204 


p54m27 




GGT 


TTT 


ATT 


AAG 


GTA 


A 


16 


205 


p54m28 


A 


GGT 


TTT 


ATT 


AAG 


GTA 




16 


206 


p54m29 


GA 


GGT 


TTT 


ATT 


AAG 


GT 




16 


207 


p54m30 




GGT 


TTT 


ATT 


AAG 


GTA 


AG 


17 


208 


p54w31 




GGT 


TTT 


ATC 


AAA 


GTA 


A 


16 


209 


p54w32 


A 


GGT 


TTT 


ATC 


AAA 


GTA 


A 


17 


210 


p54w33 


A 


GGT 


TTT 


ATC 


AAA 


GTA 




16 


211 


p54w34 


GA 


GGT 


TTT 


ATC 


AAA 


GT 




16 


212 


p54m35 




GGT 


TTT 


GTC 


AAA 


GTA 




15 


213 


p54m36 




GGT 


TTT 


GTC 


AAA 


GTA 


A 


16 


214 


p54m37 




GGT 


TTT 


GTC 


AGA 


GTA 




15 


215 


p54m38 




GGT 


TTT 


GTC 


AGA 


GTA 


A 


16 


216 


p54w39 




GGG 


TTT 


ATC 


AAA 


GTA 




15 


217 


p54w40 




GGG 


TTT 


ATC 


AAA 


GTA 


A 


16 


218 


p54w41 




GGC 


TTC 


ATC 


AAA 


GT 




14 


219 


p54w42 


GA 


GGC 


TTC 


ATC 


AAA 






14 


220 


p54m48 




GGT 


TTT 


GTC 


AAA 


GT 




14 


221 


p54m4 9 




GT 


TTT 


GTC 


AGA 


GTA 




14 


222 
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p54m50 GGT TTT GTC AGA GT 

p54w51 A GGT TTA ATC AAA GTA 

p54w52 GA GGT TTA ATC AAA GT 

p54m53 GGT TTT ACC AAA GTA 

p54m54 GGT TTT ACC AAA GT 



WO 99/67428 _ PCT/EP99/04317 



39 
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P82wl 

P82w2 

P82w3 

P82w4 

P82w5 

P82w6 

P82w7 

P82w8 

P82w9 

P82wl0 

P82W21 

P82mll 

P82ml2 

P82ml3 

P82ml4 

P82ml5 

P82ml6 

P82ml7 

P82ml8 

P82ml9 

P82m20 

P82m22 

P82m23 

P82m24 

P82m25 

P82m2 6 

P82m27 

P82m28 

P82m2 9 

P82m30 

P82m31 

P82w32 

P82w33 

P82w34 

P82w35 

P82m36 

P82m37 

P82m38 

P82m39 

P82m4 0 

P82m41 

P82w42 

P82w43 

P82w44 

P82w45 

P82w46 

P82m47 



78 79 
GGA CCT 
CCT 
CCT 
CCT 
A CCT 
A CCT 
A CCT 
GA CCT 



CCT 
CCT 
CCT 
A CCT 
A CCT 
A CCT 
GA CCT 



CCT 
CCT 
CCT 



T 
T 



CCT 
T 
CT 



80 
ACA 
ACA 
ACA 
ACA 
ACA 
ACA 
ACA 
ACA 
CA 
A 
ACA 
A 
ACA 
ACA 
ACA 
ACA 
ACA 
ACA 
ACA 
CA 
A 
ACA 
ACA 
ACA 
ACA 
A 
A 
A 
A 
A 
A 
A 
ACA 
ACA 
ACA 
CA 
ACA 
CA 
ACA 
CA 
ACA 
CA 
CA 
CA 
ACA 
ACG 
ACG 
ACA 



81 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 



82 
GTC 
GTC 
GTC 
GTC 
GTC 
GTC 
GTC 
GTC 
GTC 
GTC 
GTC 
GTC 
ACC 
ACC 
ACC 
ACC 
ACC 
ACC 
ACC 
ACC 
ACC 
ACC 
TTC 
GCC 
TCC 
TTC 
GCC 
TTC 
ACC 
TTC 
GCC 
TCC 
GTC 
GTC 
GTC 
GTC 
ACC 
ACC 
TTC 
TTC 
GCC 
GCC 
GTC 
GTC 
GTC 
GTC 
GTC 
TCC 



83 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 



84 
ATA 
ATA 
ATA 
ATA 
ATA 
ATA 
ATA 
ATA 
ATA 
ATA 
ATA 
ATA 
ATA 
ATA 
ATA 
ATA 
ATA 
ATA 
ATA 
ATA 
ATA 
ATA 
ATA 
ATA 
ATA 
ATA 
ATA 
ATA 
ATA 
ATA 
ATA 
ATA 
AT 
ATA 
ATA 
ATA 
ATA 
ATA 
ATA 
ATA 
ATA 
ATA 
GTA 
GT 



85 

ATT 

AG 

ATG 

ATT 

AG 

ATG 



ATT 

ATT 

ATT 

ATT 

AG 

ATG 

ATT 

AG 

ATG 



86 87 
GGA AGA 



length Seq ID 



GGA 
GGA A 
GG 
GGA 



ATT 
ATT 
ATT 
ATT 
ATT 
ATT 
ATT 
ATT 
ATT 
ATT 
ATT 
ATT 
ATT 



GGA 
GGA A 
G 



GGA A 
GGA A 
GGA A 

GGA 
GGA 
GGA 



AT 
AG 
ATA 



19 


228 


20 


229 


21 


230 


20 


231 


21 


232 


19 


233 


20 


234 


20 


235 


20 


236 


20 


237 


19 


238 


19 


239 


20 


240 


21 


241 


20 


242 


21 


243 


19 


244 


20 


245 


20 


246 


20 


247 


19 


248 


21 


249 


21 


250 


21 


251 


20 


252 


20 


253 


20 


254 


16 


255 


19 


256 


19 


257 


19 


258 


15 


259 


16 


260 


15 


261 


14 


262 


15 


263 


14 


264 


15 


265 


14 


266 


15 


267 


14 


268 


14 


269 


13 


270 


15 


271 


15 


272 


15 


273 


15 


274 
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ffl 



C3 



P82m48 

P82m49 

P82m50 

P82m51 

P82m52 

P82m53 

P82w54 

P82w55 

P82w56 

P82w57 

P82w58 

P82w59 

P82w60 

P82w61 

P82m62 

P82m63 

P82m64 

P82m65 

P82m66 

P82m67 

P82m68 

P82m69 

P82m70 

P82m71 

p82w72 

p82w73 

p82w74 

p82w75 

p82w76 

p82w77 

p82w78 

p82w79 

p82w80 

p82w81 

p82w82 

p82w83 

p82w84 

p82w85 

p82w86 

p82w87 

p82w88 

p82w89 

p82w90 

p82w91 

p82w92 

p82w93 

p82w94 

p82w95 

P82w96 

p82w97 

p82w98 



CA 
ACA 
ACA 
CA 
CA 
A 



A 
A 



A 

ACG 

CA 
ACA 

CA 



CA 
ACA 

CA 
ACA 
A 



CCT ACG 
CT ACG 
CCT ACG 
CCT ACG 
T ACA 
CT ACA 
CCT ACA 
CA 
A 

CT ACA 
ACA 
A 
CA 
ACA 
TCA 
ACA 
CA 

A 

CCT ACA 



CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCG 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CT 
CCT 
CCT 
CT 
CCT 
CCT 
CCT 
CCT 
CCG 
CCG 
CCG 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
CCT 
T 
T 



TCC 
TCC 
ATC 
ATC 
ATC 
ATC 
GTC 
GTT 
GTT 
GTC 
GTC 
GTC 
GTC 
GTC 
GCC 
GCC 
ACC 
ACC 
ACC 
ACC 
TTC 
TTC 
TTC 
TTC 
GTC 
GTC 
GTC 
GTC 
GTC 
GTC 
GTC 
GTC 
GTC 
GTC 
GTC 
GTC 
GTC 
GTC 
GTC 
GTT 
GTT 
GTT 
GTC 
GTC 
GTC 
GTC 
GTC 
GTC 
GTC 
GTC 
GTC 



AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAT 
AAT 
AAT 
AAT 
AAT 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAT 
AAT 
AAT 
AAT 
AA 
AAC 
AA 
A 

AAC 
AA 
A 

AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AAC 
AA 
AAC 
AAC 



ATA 
AT 
ATA 
ATA 
ATA 
ATA 
ATA 
ATA 
ATA 
ATA 
AT 
ATA 
ATA 
ATA 
ATA 
ATA 
GTA 
GTA 
GTA 
GT 
GTA 
GTA 
GTA 
GTA 
ATA 
ATA 
ATA 
ATA 



AG 

ATG 

ATG 

ATT 

ATT G 

ATG 

ATT 

ATT 

ATG 

AG 

ATT 

AG 

ATT 

ATG 



ATT 
AG 

ATG 
ATT G 
ATT G 
ATT 

ATT GG 



ATA A. 
ATA AT 
A 
AT 

ATA ATT 
ATA AG 
ATA AG 
ATA 
ATA A 
ATA AT 
ATA ATT 
ATA ATT 
ATA ATT 



ATA ATT GG 
ATA ATT GGA 



14 

14 

15 

15 

16 

15 

15 

16 

15 

15 

14 

15 

16 

16 

15 

15 

15 

14 

14 

14 

15 

15 

15 

15 

15 

16 

16 

16 

14 

14 

15 

14 

14 

13 

13 

15 

15 

15 

14 

17 

15 

16 

14 

16 

16 

15 

16 

16 

14 

15 

16 



275 

276 

277 

278 

279 

280 

281 

282 

283 

284 

285 

286 

287 

288 

289 

290 

291 

292 

293 

294 

295 

296 

297 

298 

299 

300 

301 

302 

303 

304 

305 

306 

307 

308 

309 

310 

311 

312 

313 

314 

315 

316 

317 

318 

319 

320 

321 

322 

323 

324 

325 
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Table 3 - Cont'd-9 



p82m99 




AP A 


PPT 


TTP 


p82ml00 


T 


AP A 


PPT 
<^L 1 


TTP 


p82ml01 




AP A 


PPT 


7\ TP 
/A 1 L 


P82ml02 




AP A 


PPT 


ATP 


p82ml03 




PA 


PPT 


PPP 


D82ml04 




AP A 


PPT 


PPP 

LLL 


o82ml05 




APP 


LLL 


TTP 


D82ml06 




PP 


ppp 


rnmp 


D82ml07 


T 


app 


ppp 




r>R2w1 Oft 






LLL 


/■""•» rp /—» 

LTC 


U <- W X v ^ 




a r* a 


LLL 




d82w110 




/A 


LLL 


L 1 O 


o82wlll 

K*' VV ^L. _I_ 




A 


LLL 


L 1 L 


p82wll2 


pip 
^ J. 


AP A 


PP A 


blL 


p82wll3 


CT 


AP A 


PP A 


PTP 
L 1 L 


p82wll4 




AP A 


PP Zi 
LL/i 


L 1 L 


p82wll5 




APA 


PP TV 




p82wll6 




APP 


PPT 
^L. 1 


PTP 
L 1 L 


P82wll7 

Jp-' w VV _i_ _1_ f 




A PP 


PPT 1 
LL 1 


bit 


p82wll8 


T 




PPT 


PTP 


p82mll9 


CCT 


ACA 


CCT 


TTC 


p82ml20 


CT 


ACA 


CCT 


TTC 


p82ml21 


A CCT 


ACA 


CCT 


TTC 


p82wl22 




ACG 


CCT 


GTC 


p82wl23 


T 


ACG 


CCT 


GTC 


p82wl24 




CG 


CCT 


GTC 


p82ml25 


T 


ACA 


CCT 


TTC 


p82ml26 




ACA 


CCT 


TTC 


p82ml27 




CA 


CCT 


TTC 


p82ml28 




A 


CCT 


TTC 


p82ol29 








C 


p82ol30 








C 



ML 


ATA A 




16 


326 


TV 7\ r* 


ATA 




16 


327 


7\ 7\ 


ATA ATG 




17 


328 


7\ 7\ /— < 
AAL 


ATA AG 




16 


329 


AAl 


ATA ATG 




16 


330 


AA1 


ATA AG 




16 


331 


7\ 7\ /— 1 

AAC 


ATA 




15 


332 


AAC 


ATA AG 




15 


333 


7\ TV /~» 

AAC 


AT , 




15 


334 


A AP 






14 


335 


7A TV 

AA 






14 


336 


AAC 


ATA ATG 




15 


337 


AAL 


ATA ATT 




16 


338 


AAC 






14 


339 


T\ TA P 
AHL 


A 




15 


340 


7\ TV /"» 

AAC 


ATA 




15 


341 


TV TV /"» 

AAL 


ATA AG 




16 


342 


TA TV 

AAL 


AT 




15 


343 


TV TV P 

AAL 


ATA 




15 


344 


A AP 


7\ 
■ft 




14 


345 


AAC 






15 


346 


AAC 






14 


347 


AA 






15 


348 


AAC 


ATA AGG 




16 


349 


AAC 


ATA 




16 


350 


AAC 


ATA AGG 




15 


351 


AAC 


GTA 




16 


352 


AAC 


GTA AGG 




16 


353 


AAC 


GTA ATG 




16 


354 


AAC 


GTA ATT 




16 


355 


AAC 


GTA ATT GGA 


AGA 


16 


356 


AAC 


GTA ATT GGA 


AG 


15 


357 
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Table3-Cont'd-10 





86 


87 


88 


89 


90 


91 


92 


93 


94 


lenath Sea I 




GGA 


AGA 


AAT 


CTG 


TTG 


ACT 


CAG 


ATT 


GGT 






P90wl 




A 


AAT 


CTG 


TTG 


ACT 


CAG 






16 


358 


P90w2 




GA 


AAT 


CTG 


TTG 


ACT 


CAG 






17 


359 


P90w3 




GA 


AAT 


CTG 


TTG 


ACT 


CAG 


AGG 




18 


360 


P90w4 




A 


AAT 


CTG 


TTG 


ACT 


CAG 


AGG 




' 17 


361 


P90w5 




AGA 


AAT 


CTG 


TTG 


ACT 


CAG 


AGG 




19 


362 


P90w6 




AGA 


AAT 


CTG 


TTG 


ACT 


CAG 


ATG 




20 


363 


P90w7 




AGA 


AAT 


CTG 


TTG 


ACT 


CAG 


ATT 




21 


364 


P90w8 


AGA AAT CTG TTG ACT CAG ATTGG 




20 


365 


P90w9 


GA 


AGA 


AAT 


CTG 


TTG 


ACT 


CAG 


AGG 




21 


366 


P90wl0 


A 


AGA 


AAT 


CTG 


TTG 


ACT 


CAG 


ATG 




21 


367 


P90mll 




AGA 


AAT 


CTG 


ATG 


ACT 


CAG 


ATG 




20 


368 


P90ml2 




AGA 


AAT 


CTG 


ATG 


ACT 


CAG 


ATT 




21 


369 


P90ml3 


A 


AGA 


AAT 


CTG 


ATG 


ACT 


CAG 


AGG 




20 


370 


P90ml4 


GA 


AGA 


AAT 


CTG 


ATG 


ACT 


CAG 


AGG 




21 


371 


P90ml5 


A 


AGA 


AAT 


CTG 


ATG 


ACT 


CAG 


ATG 




21 


372 


P90ml6 


GA 


AGA 


AAT 


CTG 


ATG 


ACT 


CAG 


ATT 




20 


373 


P90ml7 


GGA 


AGA 


AAT 


CTG 


ATG 


ACT 


CAG 






21 


374 


P90ml8 


A 


AGA 


AAT 


CTG 


ATG 


ACT 


CAG 






19 


375 


P90ml9 




A 


AAT 


CTG 


ATG 


ACT 


CAG 


ATT 


GG 


21 


376 


P90m20 




A 


AAT 


CTG 


ATG 


ACT 


CAG 


ATT 


G 


20 


377 


P90m21 




A 


AAT 


CTG 


ATG 


ACT 


CAG 


CTT 


G 


20 


378 


P90m22 




A 


AAT 


CTG 


ATG 


ACT 


CAG 


CTT 




19 


379 


P90m23 






AAT 


CTG 


ATG 


ACT 


CAG 


CTT 


G 


18 


380 


P90w24 




A 


AAT 


CTG 


TTG 


ACT 


CAG 


CTT 


G 


20 


381 


P90w25 




A 


AAT 


CTG 


TTG 


ACT 


CAG 


CTT 




19 


382 


P90w26 






AAT 


CTG 


TTG 


ACT 


CAG 


CTT 


G 


19 


383 


P90w27 






AAT 


CTG 


TTG 


ACT 


CA 






14 


384 


P90w28 






AAT 


CTG 


TTG 


ACT 


CAG 






15 


385 


P90w29 




A 


AAT 


CTG 


TTG 


ACT 


CA 






15 


386 


P90w30 




A 


AAT 


CTG 


TTG 


ACT 


CAG 






16 


387 


P90m31 






AAT 


CTG 


ATG 


ACT 


CA 






14 


388 


P90m32 






AAT 


CTG 


ATG 


ACT 


CAG 






15 


389 


P90m33 




A 


AAT 


CTG 


ATG 


ACT 


CA 






15 


390 


P90m34 




A 


AAT 


CTG 


ATG 


ACT 


CAG 






16 


391 


P90w35 




GA 


AAT 


CTG 


TTG 


ACT 


C 






15 


392 


P90w36 




GA 


ACT 


CTG 


TTG 


ACT 


C 






15 


393 


P90w37 






T 


CTG 


TTG 


ACT 


CAG 


ATG 




15 


394 


P90w38 




GA 


AAT 


CTG 


TTG 


ACT 


C 






15 


395 


P90w39 




GA 


ACT 


CTG 


TTG 


ACT 


C 






15 


396 


P90w40 




A 


AAT 


CTG 


TTG 


ACT 


CA 






15 


397 


P90w41 






AAT 


CTG 


TTG 


ACT 


CAG 






15 


398 


P90m42 






AAT 


CTG 


ATG 


ACT 


CAG 






15 


399 


P90m43 




A 


AAT 


CTG 


ATG 


ACT 


CA 






15 


400 


P90w44 






AT 


CTG 


TTG 


ACT 


CAG 


AG 




15 


401 


P90w45 








CTG 


TTG 


ACT 


CAG 


ATT 




15 


402 


P90w46 




AGA 


AAT 


CTG 


TTG 


ACT 








15 


403 


P90m47 






AT 


CTG 


ATG 


ACT 


CAG 


AG 




15 


404 
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Table3-Cont'd-ll 



P90m48 






CTG 


ATG 


ACT 


CAG 


ATT 




15 


405 


P90m4 9 


AGA 


AAT 


CTG 


ATG 


ACT 


CA 






17 


406 


P90w50 




AAT 


ATG 


TTG 


ACT 


CAG 






15 


407 


P90w51 


GA 


AAT 


ATG 


TTG 


ACT 


CA 






16 


408 


P90w52 




AAT 


TTG 


TTG 


ACT 


CAG 






15 


409 


P90w53 


GA 


AAT 


TTG 


TTG 


ACT 


CA 






16 


410 


P90w54 




AAT 


ATG 


TTG 


ACC 


CAG 






15 


411 


P90w55 


A 


AAT 


ATG 


TTG 


ACC 


CA 






15 


412 


P90m56 




AAT 


ATG 


ATG 


ACC 


CAG 






15 


413 


P90m57 


A 


CAG 


ATG 


ATG 


ACC 


CA 






15 


414 


P90w58 




AAC 


ATG 


TTG 


ACT 


CAG 






15 


415 


P90w59 


A 


AAC 


ATG 


TTG 


ACT 


CAG 






15 


416 


P90w60 






TG 


TTG 


ACT 


CAG 


CTT 




14 


417 


P90w61 






CTG 


TTG 


ACT 


CAG 


CTG 




14 


418 


P90m62 






CT 


ATG 


ACT 


CAG 


CTT 




14 


419 


P90m63 






CTG 


ATG 


ACT 


CAG 


C-G 




14 


420 


P90w64 






TG 


ACT 


ACA 


CAG 


CTT 




14 


421 


P90w65 






CTG 


TTG 


ACA 


CAG 


C-G 




14 


422 


P90w66 




AAT 


CTG 


TTG 


ACA 


CAG 






15 


423 


P90w67 




AAC 


CTG 


TTG 


ACT 


CA 






13 


424 


P90w68 


A 


AAC 


CTG 


TTG 


ACT 


C 






13 


425 


P90w69 


GA 


AAC 


CTG 


TTG 


ACT 








13 


426 


p90w70 






TG 


TTG 


ACT 


CAG 


ATT 


G 


15 


427 


p90w71 






TG 


TTG 


ACT 


CAG 


ATT 


GGG 


16 


428 


p90w72 






G 


TTG 


ACT 


CAG 


ATT 


GGG 


15 


429 


p90w73 






TG 


TTG 


ACA 


CAG 


CTT 


G 


15 


430 


p90w74 






CTG 


TTG 


ACA 


CAG 


CTT 




15 


431 


p90w75 






G 


TTG 


ACA 


CAG 


CTT 


GGG 


15 


432 


p90w76 






TG 


TTG 


ACT 


CAG 


CTT 


G 


15 


433 


p90w77 






G 


TTG 


ACT 


CAG 


ATG 




15 


434 


p90w78 






G 


TTG 


ACT 


CAG 


CTT 


G 


14 


435 


p90w79 






TG 


TTG 


ACC 


CAG 


ATT 


G 


15 


436 


p90w80 






G 


TTG 


ACC 


CAG 


ATT 


G 


14 


437 


p90w81 






G 


TTG 


ACC 


CAG 


ATT 


GGG 


15 


438 


p90m82 






TG 


ATG 


ACT 


CAG 


ATT 


G 


15 


439 


p90m83 






TG 


ATG 


ACT 


CAG 


ATT 


GGG 


16 


440 


p90m84 






G 


ATG 


ACT 


CAG 


ATT 


GGG 


15 


441 


p90m85 






G 


ATG 


ACT 


CAG 


ATT 


GGT 


16 


442 


p90m86 






CTG 


ATG 


ACT 


CAG 


CTT 




15 


443 


p90m87 






TG 


ATG 


ACT 


CAG 


CTT 


G 


15 


444 


P90w88 


A 


AAT 


CTG 


TTG 


ACT 


CA 






15 


445 


P90w89 


A 


AAT 


CTG 


TTG 


ACT 


CA 






15 


446 


p90w90 


A 


AAT 


CTG 


TTG 


ACT 


CA 






15 


447 


p90wl00 




AAT 


CTG 


ATG 


ACT 


CAG 






15 


448 


p90m92 


A 


AAT 


CTG 


ATG 


ACT 


CA 






16 


449 


p90m93 


GA 


AAT 


CTG 


ATG 


ACT 


C 






15 


450 


p90m94 




CTG ATG ACT CAG ATG 




15 


451 


p90m95 


AGA 


AAT 


ATG 


ATG 










15 


452 


p90m96 


A AGA 


AAT 


ATG 


ATG 


ACT 








16 


453 
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p90m97 


A 


AGA 


AAT 


CTG 


ATG 


ACT 






16 


454 


p90m98 


A 


AGA 


AAT 


ATA 


ATG 


ACT 






16 


455 


p90m99 




A 


AAT 


ATA 


ATG 


ACT 


CAG 




16 


456 


p90ml00 






AAT 


ATG 


ATG 


ACC 


CAG 




15 


457 


p90ml01 






AAC 


CTG 


ATG 


ACT 


CAG 




15 


458 


p90ml02 




AGA 


AAT 


TTG 


ATG 


ACT 


C 




16 


459 


p90ml03 




A 


AAT 


TTG 


ATG 


ACT 


ATG 


ACT 


16 


460 


p90ml04 






AC 


CTG 


ATG 


ACT 


CAG 




14 


461 


p90ml05 






AAT 


CTG 


ATG 


ACT 


CAG 


A 


16 


462 


p90ml06 






AT 


CTG 


ATG 


ACT 


CAG 


ATG 


16 


463 


p90ml07 






AT 


CTG 


ATG 


ACT 


CAG 




14 


464 


p90ml08 








CTG 


ATG 


ACT 


CAG 


ATT G 


16 


465 


p90ml09 




AGA 


AAT 


CTG 


ATG 


ACT 


C 




16 


466 


p90mll0 




AGA 


AAT 


CTG 


ATG 


ACT 






15 


467 


p90mlll 


GA 


AGA 


AAT 


CTG 


ATG 


A 






15 


468 


p90mll2 


GGA 


AGA 


AAT 


CTG 


ATG 


A 






16 


469 


p90mll3 


GA 


AGA 


AAT 


CTG 


ATG 


AC 






16 


470 


p90mll4 




AGA 


AAT 


CTG 


ATG 


AC 






14 


471 


p90wll5 






AAT 


CTG 


TTA 


ACT 


CAG 




15 


472 


p90wll6 






T 


CTG 


TTA 


ACT 


CAG 


ATT 


16 


473 


p90wll7 






AT 


CTG 


TTA 


ACT 


CAG 


AG 


15 


474 


p90wll8 




AGA 


AAT 


TTG 


TTG 


ACT 






16 


475 


p90wll9 




GA 


AAT 


TTG 


TTG 


ACT 


C 




15 


476 


p90wl20 






AAT 


TTG 


TTG 


ACT 


CAG 




15 


477 



Iff 
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non- 


00 

<J\ 


CM 






o 








m 


















& 


\> 
in 


rH 
rH 


LD 
CO 


rH 
rH 


rH 
rH 




0 

\l 




probes for 
codon p50 


m 




CN 
LO 


I> 

B 


• 

CD 
d> 
CJ 




m 


















o 

bj 




non- 


o 
r- 


CN 
CN 




o 


CO 










m 

rH 


rH 

, — i 




CN 

ro 


o 






in 


































I 


H 


probes for 
codon p48 


r- 


LD 


CN 

r- 


rH 

B 


neg. 








PQ 


















non- 


CO 


O 


rH 


o 


o 


rH 
























& 


r- 

ITi 


i— 1 
rH 


rH 
rH 


rH 
rH 


rH 
rH 


O 






probes for 
codon p30 


IT) 
CN 
5 


<J\ 

CN 

3: 


CN 

ro 


ro 


ro 
CN 
B 


neg. 
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non-B 


LO 
CN 


17.5 


o 


m 

VD 


LO 
CN 


LO 
CN 


22.5 


o 

rH 


LO 


LO 
CN 


12.5 










Type B 


O 

Ln 


rH 

VD 
VD 


rH 


LO 

CN 
rH 


rH 


LO 


in 


O 


VD 

CTi 
rH 


O 


VD 

ro 






M 

m 




probes for 
codon p90 


c 

CN 


ro 


o\ 
ro 


LO 


CN 
LO 


CTi 
VD 


ro 
{> 
3s 


CA 

r- 
3: 


ro 
B 


VD 
LO 

B 


neg. 










CQ 




























w 




non- 


o 
o 


CN 

rH 


o 


O 
rH 


CN 


o 


o 


CN 


o 


o 


CN CN 


CN 
rH 


CO 


U1 






























ni 


:ont'd 


Type B 


\J\ 

rH 
CO 


rH 
CN 


rH 
rH 


rH 
rH 


ro 


rH 
CN 


rH 
rH 


rH 
CN 


rH 
rH 


rH 
rH 


14.9 
3.2 


rH 
CN 


CN 

ro 


01 
































Q 


Table 5 - 


probes for 
codon p82/84 


rH 

U 1 

3£ 


o 


Will 


G\ 
CO 


CN 


VD 

ro 


VD 

B 


CO 

ro 
B 


LO 
O 
rH 

B 


r*- 

CN 
rH 
B 


o ro 
^ vo 
B B 


mlOl 


neg. 






CQ 
































non- 


CO 


CN 

vo 


CO 
rH 


o 


CO 






o 














































Type B 


ro 
rH 

r- 


CA 

rH 
CO 


CN 
ro 


VD 


ro 


o 


o 


CN 

ro 


rH 


rH 
rH 


o 










probes for 
codon p54 


ro 


ro 
3* 


rH 

3* 


CA 

rH 

5 


CN 
CN 


VD 

CN 

2 


o 

CN 
3? 


LO 
LO 

B 


Ln 
ro 
E 


ro 
B 


neg . 
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Table 6 



p30 


USA 


France 


U.K. 


Brazil 


Spain 


Luxemb . 


Belgium 


w25 


98.9 


99. 4 


88. 9 


98.3 


94 . 3 


100.0 


97.0 


w2 9 


2.5 


0.6 


0.0 


1.7 


0.0 


0.0 


0.0 


W j/ 




U . b 


o . to 


3 . Z 


D . 7 


6 . 7 


1 . 5 


w36 


2.5 


0.0 


0.0 


3.4 


0.0 


0.0 


1.0 


m23 


3.1 


0.0 


0.0 


0.0 


0.0 


0.0 


1.0 


neg. 


0.6 


0 . 6 


5.6 


0.0 


0.0 


0.0 


1.0 


p46/48 


USA 


France 


U.K. 


Brazil 


Spain 


Luxemb. 


Belgium 


w47 


94 .2 


80.5 


83.3 


89.7 


97. 1 


73.3 


82.9 


W *i *J 


ft 6 








D . / 


6 . 7 


11.1 


w72 


4.2 


0.0 


16.7 


0.0 


2.9 


13.3 


5.0 


m41 


0.0 


0.0 


0.0 


10.3 


0.0 


13. 3 


1.0 


neg . 


2.8 


4 . 5 


0.0 


0.0 


0.0 


0.0 


2.5 


p50 


USA 


France 


U.K. 


Brazil 


Spain 


Luxemb . 


Belgium 


w31 


96.4 


97.4 


100.0 


96. 6 


100. 0 


100.0 


96.5 


w44 


1.7 


0.6 


0.0 


1.7 


0 . 0 


0.0 


1.0 


w52 


10.0 


4.5 


0.0 


1.7 


2.9 


6.7 


9.0 


m37 


2.5 


0.6 


0.0 


1.7 


0.0 


6.7 


0.5 


neg . 


3.1 


2.6 


0.0 


3.4 


0.0 


0.0 


1.5 


p54 


USA 


France 


U.K. 


Brazil 


Spain 


Luxemb . 


Belgium 






o / . o 


y / . Z 


o / . y 


100.0 


53 . 3 


89.4 


w3 


84.7 


77 . 9 


94 . 4 


69. 0 


82. 9 


46.7 


76.9 


wl4 


3.3 


5.8 


0.0 


3.4 


11.4 


0.0 


6.5 


wl9 


9.2 


2.6 


0.0 


1.7 


2.9 


6.7 


5.5 


w22 


2.8 


10.4 


0.0 


0.0 


5.7 


0.0 


2.5 


w2 6 


0.0 


1.3 


0.0 


0.0 


0.0 


0.0 


0.0 


w27 


0.0 


1.9 


0.0 


0.0 


0.0 


0.0 


0.5 


m55 


0.0 


0.0 


0.0 


0.0 


0.0 


13.3 


0.5 


m35 


1.1 


0.0 


2 . 8 


6.9 


0.0 


46.7 


3.0 


m37 


0.0 


0.0 


0.0 


0.0 


0.0 


13.3 


0.0 


neg. 


0.6 


1.3 


0.0 


1.7 


0.0 


0.0 


2.0 


p82/84 


USA 


France 


U.K. 


Brazil 


Spain 


Luxemb . 


Belgium 


w91 


91.6 


93.5 


94 .4 


77.6 


100. 0 


73. 3 


85.9 


w60 


6.4 


2.6 


0.0 


1.7 


2.9 


13.3 


5.5 


will 


3.6 


0.6 


0.0 


1 . 7 


0.0 


0.0 


0.5 


w89 


7.0 


1.9 


0.0 


3.4 


0.0 


0.0 


3.0 


w42 


0.6 


0.0 


2 . 8 


1.7 


0.0 


0.0 


2.0 


m3 6 


0.3 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


m67 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.5 
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Table 6 - Cont'd 



m38 


0.0 


0.0 


0.0 


0.0 


0.0 


6.7 


0.0 


ml05 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


ml27 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


m4 0 


2.8 


0.0 


8.3 


3.4 


5.7 


46.7 


0.0 


m63 


0.3 


0.0 


0.0 


1.7 


2.9 


13.3 


0.5 


mlOl 


1.9 


4 . 5 


0.0 


3.4 


0.0 


6.7 


4.0 


neg . 


2.5 


3.9 


0.0 


13.8 


0.0 


6.7 


5.0 


p90 


USA 


France 


U.K. 


Brazil 


Spain 


Belgium 




w27 


51. 1 


45.5 


34.3 


47.7 


52. 8 


25.7 




w37 


91. 9 


73.4 


80.0 


81.8 


88.9 


55.2 




w39 


0.0 


0.0 


0.0 


0.0 


0.0 


2.9 




w50 


2.6 


23.8 


2.9 


13.6 


11. 1 


21.9 




w52 


8.4 


11.2 


5.7 


6.8 


13. 9 


4 . 8 




w69 


5.2 


1.4 


5.7 


2.3 


0.0 


3. 8 




w73 


6.1 


9.1 


0.0 


0.0 


8.3 


6.7 




w7 9 


7.1 


11.2 


8 . 6 


9.1 


5.6 


5.7 




m43 


1.9 


0.0 


11.4 


0.0 


0.0 


8.6 




m56 


0.3 


1.4 


0.0 


0.0 


0.0 


0.0 




neg . 


1.0 


0.0 


0.0 


0 . 0 


0.0 


18.1 
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Table 7 

Tm lengte Seq ID 



pc50w5 


AGG 


GGG 


AAT 


TGG 


AGG 


TTT 


TA 










20 


511 




26 


27 


28 


29 


30 


31 


32 


33 


34 


35 










ACA 


GGA 


GCA 


GAT 


GAT 


ACA 


GTA 


TTA 


GAA 


GAA 








pc30w25 






GCA 


GAT 


GAT 


ACA 


GT 








40 


14 


31 


pc30w29 




A 


GCG 


GAT 


GAT 


ACA 










36 


13 


35 


pc30w32 






GCA 


GAT 


GAC 


ACA 


GT 








42 


14 


38 


pc30w36 






GCA 


GAC 


GAT 


ACA 


GG 








40 


14 


42 


pc30m23 




A 


GCA 


GAT 


AAT 


ACA 


GT 








40 


15 


29 




44 


45 


46 


47 


48 


49 


50 


51 


52 












CCA 


AAA 


ATG 


ATA 


GGG 


GGA 


ATT 


GGA 


GGT 










pc48w37 






ATG 


ATA 


GGG 


GGA 


ATT 










15 


512 


pc48w47 




AAA 


ATG 


ATA 


GGG 


GGA 










42 


15 


93 


pc48w73 


A 


AGA 


ATG 


ATA 


GGG 


G 












14 


513 


pc48w45 




AAA 


ATG 


ATA 


GGA 


GGA 


ATT 








42 


18 


91 


pc48w72 


A 


AAA 


ATA 


ATA 


GGG 


GGA 










42 


16 


120 


pc4 8m41 






ATG 


ATA 


GTG 


GGA 


ATT 








40 


15 


87 





48 


49 


50 


51 


52 


53 


54 










GGG 


GGA 


ATT 


GGA 


GGT 


TTT 


ATC 








pc50w31 




GGA 


ATT 


GGA 


GGT 


TTT 




42 


15 


151 


pc50w44 




GGA 


ATT 


GGG 


GGT 


TT 




42 


14 


164 


pc50w52 




GA 


ATT 


GGA 


GGC 


TTG 






14 


172 


pc50m37 


GGG 


GGA 


GTT 


GGA 








40 


12 


157 




51 


52 


53 


54 


55 


56 


57 58 










GGA 


GGT 


TTT 


ATC 


AAA 


GTA 


AGA CAG 








pc54w34 


GA 


GGT 


TTT 


ATC 


AAA 


GT 




42 


16 


212 


pc54wl4 




GGT 


TTT 


ATC 


AAG 


GTA 


A 


42 


16 


189 


pc54wl9 


A 


GGC 


TTT 


ATC 


AAA 


GTA 




42 


16 


194 


pc54w22 


GA 


GGT 


TTT 


ATT 


AAA 


GTA 




42 


17 


197 


pc54w26 


A 


GGT 


TTC 


ATT 


AAG 


GTA 




42 


16 


202 


pc54w27 




GGT 


TTT 


ATT 


AAG 


GTA 


A 


40 


16 


204 


pc54m35 




GGT 


TTT 


GTC 


AAA 


GTA 




40 


15 


213 


pc54m37 




GGT 


TTT 


GTC 


AGA 


GTA 




42 


15 


215 


pc54m55 


A 


GGT 


TTT 


GCC 


AAA 


GT 






15 


516 





78 


79 


80 


81 


82 


83 


84 


85 


86 


87 










GGA 


CCT 


ACA 


CCT 


GTC 


AAC 


ATA 


ATT 


GGA 


AGA 








pc82w91 






ACA 


CCT 


GTC 


AAC 


ATA 


A 






44 


16 


318 


pc82w60 






CA 


CCT 


GTC 


AAT 


ATA 


ATG 






42 


17 


287 


pc82wlll 






A 


CCG 


GTC 


AAC 


ATA 


ATT 






44 


16 


338 


pc82w89 






ACA 


CCT 


GTT 


AAC 


ATA 


AG 






42 


17 


316 


pc82ml01 






ACA 


CCT 


ATC 


AAC 


ATA 


AT 








17 


517 


pc82w42 






CA 


CCT 


GTC 


AAC 


GTA 








42 


14 


269 


pc82m38 






ACA 


CCT 


TTC 


AAC 


ATA 








40 


15 


265 


pc82ml05 






ACG 


CCC 


TTC 


AAC 


ATA 








44 


15 


332 


pc82ml27 






CA 


CCT 


TTC 


AAC 


GTA 


ATG 






44 


17 


354 
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Table 7 - Cont'd 



pc82m40 
pc82m63 
pc82m36 
pc82m67 



ACA CCT GCC AAC ATA 

CA CCT GCC AAT ATA AG 

ACA CCT ACC AAC ATA 

ACA CCT ACC AAC GT 





86 


87 


88 


89 


90 


91 


92 




GGA 


AGA 


AAT 


CTG 


TTG 


ACT 


CAG 


pc90w27 






AAT 


CTG 


TTG 


ACT 


CA 


pc90w37 






T 


CTG 


TTG 


ACT 


CAG 


pc90w39 




GA 


GTC 


AAC 


AGA 


GTT 


C 


pc90w50 






AAT 


ATG 


TTG 


ACT 


CAG 


pc90w52 






AAT 


TTG 


TTG 


ACT 


CAG 


pc90w69 




GA 


AAC 


CTG 


TTG 


ACT 




pc90w73 








TG 


TTG 


ACA 


CAG 


pc90w79 








TG 


TTG 


ACC 


CAG 


pc90ml38 




GTC ATC AGA TTT CT 




pc90m56 






AAT 


ATG 


ATG 


ACC 


CAG 



93 94 



G 
G 



A A 


15 


267 


42 


16 


290 




15 


518 




14 


519 


38 


14 


384 




15 


514 




15 


515 


40 


15 


407 


40 


15 


409 


40 


14 


426 


44 


15 


430 


44 


15 


436 




14 


510 


42 


15 


413 



